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1 

KETHOD FOR SITE-SI^CXFXCAIJjY 
INCORPORATING UNNATURAL AMINO ACIDS INTO PROTEINS 

5 

This is a continuation-in-part application of 
commonly assigned patent applications U.S. S.N. 273,4 55 and 
U.S. S.N. 273,786, both filed November 18, 1988, which are 
incorporated herein by reference. 
10 This invention was made in part with government 

support under ONR Contract N00014-86-K-0522, awarded by the 
Office of Naval Research, and DOE grant No. DE AC03- 
76SF00098. The government may have certain rights in this 
invention. 

15 

Field of the Invention 

This invention relates generally to protein 
biochemistry and, more particularly, to site specific 
modification of proteins generally useful for controlling 
20 specificity and activity of enzymes, and for altering the 
natural structural properties of proteins. 

BACKGROUND OF THE INVENTION 

Classically, biochemists have modified protein 
25 function by either chemically altering isolated proteins or 
by selecting naturally occurring variants. Chemical 
modification is typically directed to unusually reactive and 
solvent accessible eunino acid side chains, but often the 
desired modification sites are not accessible or the 
30 modifications of interest are not chemically feasible. Low 
specificity of modification reactions dramatically hinders 
the usefulness of this approach. Moreover, naturally 
occurring variants are rare, usually require substantial 
analysis to determine the nature of any variation, eoid are 
35 generally limited to substitutions with naturally occurring 
amino acid residues. 

With the advent of molecular biology technology, 
other approaches have been developed which allow, albeit '^in a 
limited manner, particular amino acids to be incorporated 
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int:o pro'teins or peptides during the process of peptide bond 
polymerization. Peptide synthesis and semisynthetic methods 
have been used to introduce novel amino acids into very small 
proteins and peptides. Modified amino acids have been 
5 uniformly incorporated into peptides and proteins using 

functional analogues of aminoacyl transfer I^A*s (tRNA's) . 
Xn addition, several unnatural amino acids have been 
incorporated into dipeptides using chemically misacylated 
tRNA's. 

10 All Of these methods suffer from one or more of the 

following drawbacks? lack of site specificity in the 
introduction of the novel amino acid; heterogeneity in sites 
of modification; low efficiency in modifications; a 
requirement for extensive characterization to determine 

15 precisely the location and nature of any substitution; severe 
size restrictions on the protein of interest and very limited 
possibilities for substitutions or modifications. Thus,, 
there exists a need for improved methods for generating 
proteins with specific modifications at desired locations. 

20 The present invention fulfills these and other needs. 

SUMMARY OF THE INVENTION 
In accordance with the present invention, novel 
methods are provided for site specifically incorporating an 
25 unnatural amino acid analogue into a protein, the methods 
comprising the steps of: 

(a) introducing a preselected codon into at least 
one site in a mRNA sequence encoding the 
protein; and 

30 (b) translating the mRNA sequence in a protein 

synthesizing system comprising an aminoacyl 
tRNA analogue capable of polymerizing the 
uzmatural amino acid analogue into a nascent 
polypeptide chain on direction of the 
35 preselected codon. 

The protein synthesizing system is preferably an in vitro 
protein synthesizing system, and the preselected codon a 
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t:enninat:ion codon, such as UAG (amber) , inserted at 
predetermined sites. 

Tlie unnatural amino acid analogue will typically be 
selected from modified natural amino acids, modified 
5 uncharged amino acids, modified acidic amino acids, modified 
basic amino acids, non-alpha amino acids, amino acids with 
altered tlf, <p angles, and amino acids containing functional 
groups selected from the group of nitro, amidine, 
hydroxylamine , guinone, aliphatic, cyclic and unsaturated 
10 chemical groups. Preferably, the aminoacyl tRNA analogue is 
the only aminoacyl tRNA molecule in the protein synthesizing 
system capable of recognizing the preselected codon and the 
preselected codon is introduced into one site of the mRNA 
sequence encoding the protein which typically has a molecular 
15 weight greater than about ten thousand daltons. The 

unnatural amino acid analogue may be situated within about 
100 angstroms of a substrate binding site, an enzymatic 
active site, a protein-protein interface, a cof actor binding 
site, or a ligand (agonist or antagonist) binding site. 
20 Another aspect of the present invention includes 

novel proteins (usually greater than about 10 Kd) that are 
stoichiometrically siibstituted at one or more predetermined 
sites, preferably substantially homogeneously, with an 
unnatural amino acid analogue. Analyzing the physical or 
25 biochemical properties of the protein can determine various 
properties, such as static physical properties of the 
polypeptide chain, mechanism of action of an enzymatic 
reaction, specificity of protein binding to ligand, dynamic 
interaction of amino acid residues of a stibject protein with 
30 a substrate, folding of the protein, or interaction of the 
proein with other proteins, with nucleic acids or with 
sugars. Typically, the protein is analyzed within about 100 
angstroms of the unnatural amino acid analogue insertion. 

In another aspect, the present invention provides 
35 methods for making multiple alternative substitutions at 

preselected amino acid positions of a protein comprising the 
steps of: 
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a) producing one mRNA wlUi mis-translation codons 
at sites in the mRNA corresponding to the 
preselected amino acid posi-tions; and 

b) translating the mRNA in a series of two or 
5 more translation systems each comprising an 

aminoacyl tRNA analogue, whereby the protein 
produced by one translation system differs 
from the protein produced by another system at 
the preselected amino acid position. 
10 Preferably, one amino acid position is substituted 

and the difference between proteins produced by the different 
translation systems is predetermined by the preselection of 
unnatural amino acid analogues attached to the aminoacyl 
tRNAs. The unnatural amino acid sxibstitution may be, for 
15 example, D-phenylalanine, (S) --p-nitrophenylalanine, (S)- 

hombphenylalanine , (S ) -p-f luorophenylalanine , ( S ) -3 -amino-2 - 
benzylpr op ionic acid, or (S)-2-hydroxy-3-phenylpropionic 
acid. 

Yet another aspect of the present invention relates 
20 to methods for producing an aminoacyl tRNA analogue molecule 
comprising the steps of: 

a) attaching a predetermined unnatural amino acid 
analogue by an aminoacyl linkage at 2 * or 3 ■ 
ribosyl hydroxyl positions on the 3' terminal 

25 nucleotide of a multi-nucleotide molecule 

(MNM) ; and 

b) ligating the aminoacyl -multi-nucleotide 
molecule (aminoacyl-MNM) to a truncated tRNA 
molecule (tRNA(-Z)), wherein a functional 

30 aminoacyl tRNA analogue molecule is formed. 

Preferably, nucleotide molecule (MNM) , which may be a 
dinucleotide such as 5'-p^A-3', corresponds to a tRNA 3* 
terminus. The ligation of the multi-nucleotide molecule 
(MNM) to the tRNA(-Z) molecule typically crenerates a complete 

35 tRNA molecule,, and the tRNA(-2) may be derived from a run-off 
transcript. 

The attaching of the predetermined unnatural amino 
acid analogue by an aminoacyl linkage at 2 or 3 • ribosyl 
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hydroxyl positions on the 3 • terminal nucleotide of a multi- 
nucleotide molecule (MNM) is preferably accomplished by the 
steps of: 

a) protecting reactive chemical groups of the MNM 
5 with protective agents; 

b) protecting reactive non^aminoacyl reactive 
groups of the amino acid analogue with a 
blocking agent; 

c) acylating the MNM with a blocking agent- 
10 protected cunino acid analogue; and 

d) removing the protective agents and blocking 
agents from the protected reactive sites. 

Some or all of the reactive group protecting steps are 
substituted with steps using blocking or protective agents 

15 selected from the group consisting of: o-nitrophenylsulf enyl 
(NSP) ; )3-cyanoethyl (EtCNO) ; benzylbxycarbonyl (CBZ) ; 9- 
f luoreny lauaethy loxycarbonyl ( FMOC ) ; 2 - ( 4 -bipheny 1 ) 
isopropyloxycarbonyl (BPOC) ; vinyl oxycarbonyl (VOC) ; 
tetrahydropyranyl (THP) ; methoxytetrahydropyranyl ; and 

20 photolabile groups, including 4-methoxy-2- 

nitrobenzyloxycarbamates (NVOC) . The protecting steps may be 
performed using o-nitrophenylsulf enyl (NPS) for both the 
blocking agents and protective agents, and the ligating of 
the aminoacyl-MNM to the tRNA(-Z) may be performed by the 

25 enzyme T4 RNA ligase. 

Another aspect of the present invention comprises 
aminoacyl tSNA analogues having the formula X A Y - M, 
wherein: 

X 5" nucleotide sequence of a tRNA molecule; 
30 A = anticodon nucleotides; 

Y = 3* nucleotide sequence of a tRNA molecule, 
such as 5 ' -pCpCpA-3 • ; 

M ^ amino acid analogue selected from the 
group consisting of: 
35 i) modified uncharged natural 

amino acids; 
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II) modified acidic na-kiiral amino 
acids; and 

III) non-alpha amino acids. 

These analogues will be able to direct the polymerization of 
5 the M component Into a nascent polypeptide chain and can 

serve as an acceptor for further peptide polymerization. For 
example, an analogue corresponding to t^NA^^^^®^ 
amlnoacylated with (S) -p-nitrophenylalanlne ccoi be produced, 
wherein: 

lb a) X comprises the 5 • segment of tRNA^^^^®^ 

containing a "D loop" and part of an 
"antlcodon loop"; 
b) A (antlcodon) comprises the 
trinucleotide 5 ' -pCpUpA-3 * ; 
15 c) y comprises the 3» segment of tRNA^^^^^^^ 

containing part of ah "antlcodon loop", a 
"variable loop", a "T*C loop", and an 
"acceptor stem"; and 
d) K is (S)-p-nltrophenylalajilne. 
20 The present invention further Includes translation 

systems comprising such aminoacyl tSNK analogues. Also 
Included is a coupled transcription and translation system, 
wherein products , of the transcription system are translated 
by a translation system comprising such aminoacyl tBNA 
25 analogues. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 Is a schematic representation of the 
method for introducing unnatural amino acids site 
30 specifically into proteins. 

Figure 2 shows schemes for synthesizing 
2UDlnoacylated pC^A. 

Figure 3 shows the construction of the plasmld 
pSG7, the vector for in vitro expression of p-lactaxaase. 
35 Segment a is the 259-bp BajnHI-^coRI fragment of pKKK223-3 
(Broslus and Holy, (1984) Proc. Natl. Acad. Sci. USA . 
81:6929), containing the tac promoter. Segment b is the 376- 
bp SspX (llnkered to £coRI-PiruI fragment of pTG2dell 
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(Kadonaga et alo, (1984) J. Biol, Chem. 259s2149) containing 
the first part of the gene for RTEM ^-lactamase (Sutcliffe, 
(1978) Proc. Natl, Acad, Scl, USA , 75s3737; Ambler and Scott, 
(1978) Proc, Natl, Acad. Sci. USA . 75:3732? and Pollitt and 
5 Zalkin^ (1983) J- Bacteriol . 153s 27) with a 63-bp deletion 
corresponding to 21 amino acids in the leader sequence. 
Segment c is the 1386-bp PvuX-^HaelT fragiaent of pT7-3 (Tabor 
and. Richardson, (1985) Proc. Natl, Acad> Sci, USA. 82s 1074), 
containing the remainder of the /9-lactamase gene and the 
10 ColEl origin of replication « Segment d is the 1430-bp Haell- 
ITaeJ fragment of pGPl-2 (Tabor and Richardson, (1985) Proc. 
Natl. Acad. Sci, USA ^ 82s 1074), containing the kanamycin 
resistance gene from Tn903 (Oka et alo, (1981) Jo Mol, Biol, 
147s 217) • This gene is oriented so as not to be iinder the 
15 transcriptional control of the tac promoter. Segment e is 

the 289«=-bp ffaell-PvuII (ligated to the blimt-ended BamHI site 
of segment a to regenerate only the BamHI site) fragment from 
pT7"3o 

Mutants at Phe66 (^on figure) were generated using 
20 the method of Eckstein (Nakamaye and Eckstein, (1986) Nucl. 
Acids. Res. . 14 5 9679). The 204-bp £coRI=fflncII fragment, 
containing the codon for Phe66, was cloned into M13mpl8. 
Three synthetic oligodeoxymucleotides, 5'- 

ATCATTGGATAACGTTCTT-3 ■ , 5 " -ATCATTGGAGCACGTTCTT- 3 • , and 5 ■ - 

25 ATCATTGGCTAACGTTCTT-'3 • (underlined bases denote mismatches to 

the wild-type sequence) were used to generate the F66Y, F66A, 

and F66am mutants, respectively. Mutagenesis efficiencies 

were 100% for Tyr66, 83% for Ala66, and 60% for TAG660 

Figure 4 shows the in vitro synthesis and 

30 purification of truncated i9-lactamase. Xane Is Crude in 

vitro reaction; Lane 2s Purified ^-lacatamase synthesized in 

vitro g Lane 3 2 Purified /9-lactamase synthesized in vivo 

(JM101/PSG7) . 

Figure 5 shows the tRNA PHE/CUA (--CA) sequence as 

P h 6 

35 determined by the enzymatic method. The tRNA^^^(-CA) was 
3' end--labelled with [5 ° -"^^P] -pCp and the sequence was 
determined by the enzymatic method (Donis=-Keller , (1980) 
Nucl. Acids. Res. , 8s3133)o The products of the digestion 
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reacrtions were loaded 0111:0 a 10% dena'turing polyacrylamlde 
gel. An aut:oradiogrsua of the sequencing gel is sliown: no 
enzyme (lane 1)^ -OH digest (lanes 2 and 7), RNase Tl (G- 
specific^ lane 3), RNase U2 (A-specific, lane 4), RNase Phy M 
5 (U+A-specif ic, lane 5) RNase B. cereus (U+C-specif ic, lane 
6) • The sequence of the anticodon stem and loop is shown, 
the site of CUAA incorporation is indicated by larger 
letters. tRNA^^^jj^^(-CA) was either purified by preparative 
gel electrophoresis and used in chemical misacylation 

IQ reactions, or treated with nucleotidyl trcuisf erase (Cudney 

and Deutscher, (1986) J, Biol, Chem, . 261:6450), gel-purified 
and used in misacylation reactions with yeast PRS. 

Figure 6 shows the test of acylated and nonacylated 
suppressor tRNA in vitro . Reactions (30mIi} were carried out 

15 as described in Figure 4, cooled to 0*C and centrifuged. 

Three fiZt of each supernatant was denatured and loaded onto a 
12.5% SDS polyacrylamide gel (Laemmli, (1970) Nature 
f London) ^ 227:680), which was dried and autoradiographed 
following electrophoresis. Lane 1: Reaction primed with pSG7 

20 (truncated j^-lactamase) ; i;ane 2: Reaction primed with pF66am, 
with no added suppressor; Lane 3: Reaction primed with 
pF66am, with non-acylated suppressor (5/ig) added. Lanes 1, 2 
and 3 were supplemented with £-^H]-Phe (Amersham) to a final 
specific activity of 190 Ci/mol; Lane 4: Reaction primed with 

25 pF66am and 5/xg suppressor that had been acylated 

enzymatically with [-^HJ-Phe (specific activity 9.4 Ci/mmol 
Phe-tRNA) . 

Enzymatic misacylation reactions (300 fila total 
volume) contained the following: 4 /xM tRNA^^'^^®^ (30 ^g, 

30 which had been desalted and lyophilized following gel 

purification), 80 /m phenylalanine, 40 mM Tris-HCl (pH.8.5), 
15 mM MgCl2, 45 /xg/ml. BSA, 3.3 mM DTT, 2 mM ATP and 22 Units 
yeast PRS (where i unit activity incorporates 100 pmol Phe in 
2 minutes at 37 "C imder the following conditions: 2 fM 

35 tRNA- ^ (Boehringer Mannheim) , 2 mM ATP, 3.3 mM DDT, 8.1 ^iM 

Phe, 40 mM Na HEPES (pH 7.4) , 15 mM MgCl2/ 25 mM KCl, and 50 
fig/wla BSA) . The reaction mixture was incubated at 37 •C for 3 
minutes, then quenched by addition of 2.5 M NaOAc (pH 4.5) to 
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10% v/v. The quenched reaction was immediately extracted 
with phenol (pre-eguilibrated with 0.25 M NaOAc, pH 4.5)^ 
phenol:CHCl3(l:l) , CHCI3, then precipitated with EtOH. The 
extraction and precipitation were repeated twice more. The 
5 tRNA was then desalted on a Pharmacia fast desalting column 
and lyophilized. The lyophilized mixture of acylated and 
non-acylated tHNA was stored at -80**C until immediately prior 
to its use in Iq vitro protein synthesis reactions. 

i9-Lactamase activity in the supematants of these 

10 reactions was measured using the nitrocefin hydrolysis assay 
(O'Callaghan et al., (1972) Antimicrob> Aa. Chemother. > 
1:283. One nitrocefin hydrolysis unit (1 ^mole nitrocefin 
hydrolyzed/min/mL, 0.1 mM nitrocefin, 50 mM phosphate buffer, 
pH 7) corresponds to 0.61 /tig enzyme, as determined by 

15 Bradford assay. )• 

Reaction ua/mL 

1 pS67 44«6 

2 pF66am 0 
20 3 pF66am, non-acylated 

suppressor 0 
4 pF66am, acylated 

suppressor 6 • 7 



25 Figure 7 shows the method of chemical 

aminoacylation of the dinucleotide pCpA. The dinucleotide 
pCpA was prepared by standard solution phase phosphotriester 
synthesis (Jones et al., (1980) Tetrahedron , 36:3015; Van 
Boom and Wreesman in "Oligonucleotide Synthesis", Gait (Ed.), 

30 IRL Press, Washington, 1984) • The fully protected molecule 
was 4-chlorophenyl-4-N-anisoyl-2 • -O-tetrahydropyranyl-5 ' -0- 
[/3-cyanoethyloxyphosphoryl] cytidylyl (3«-5»)-[6-N, 6-N, 2*- 
0, 3 •-0-tetrabenzoly] adenosine. Then o-Nitrophenylsulf enyl 
chloride (1.8 mmol) and triethylamine (1.8 mmol) were added 

35 over six hours to pCpA (285 /xmol) dissolved in 

dimethylsulf oxide (68 mL) . The reaction was quenched by 
addition of 50mM ammonium acetate, pH 5 (100 mL) and the 
solvent was removed in vacuo . Purification by reverse phase 
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HPLC (HPLC conditions: A = 5 mM anmonitm acetate pH 5, B = 
HeCM^, gradient = 0 to 15% B in 60 min. ; 15 to 30% B in 30 
min. flovr rate = 8 mL/min. , colimn = Whatman Partisil 10 M-20 
10/50 ODS-3.) yield NPS-pCpA in 65% yield and recovered pCpA 
5 in 22% yield. NPS-pCpA was desalted by reverse phase HPLC 

(HPLC conditions: A = ^2^^ ® ^ MeCN, gradient = 0 to 15% B in 
60 min. , 15 to 30% B in 30 min. , flow rate = 8 mL/min. , 
column = Whatman Partisil 10 M-20 10/50 ODS-3.) followed by 
passage through a Dowex column (Li+ form) • o- 

10 Nitrophenylsulfenylphenylalanine (74 imol) and N,N«- 

carbonyldiimidazple (83Minol) were stirred under nitrogen in 
anhydrous dimethylsulf oxide (320 /iL) for thirty minutes. The 
solution was then added to the lithium salt of NPS-pCpA (15 
^-mol, dried by repeated co-evaporation with toluene) . The 

15 reaction was stirred under nitrogen at 50 'C for eight hours, 
then quenched at 0 *C by addition of '50 mM ammonium acetate, 
pH 5 (2 mL) . Lyophilization followed by reverse phase HPLC 
(HPLC conditions: A = 50 mM ammonium acetate, B = MeCN, 
. gradient = O to 70% B in 70 min. , flow rate = 8 mL/min. , 

20 colimn = Whatman Partisil 10 M-20 10/50 ODS-3.) provided the 
desired product in 16% yield with 38% starting material being 
recovered. After lyophilization, the product (2.4 imo\) was 
dissolved in 40 mM sodium thiosulfate, 50 mM sodium acetate, 
pH 4.5 (2 mL) and stirred for one hour. Reverse phase HPLC 

25 (HPLC conditions: A 8 mM HOAc, B = MeCN, gradient == 0 to 
30% B in 45 min., flow rate = 4 mL/min., column = Whatman 
Partisil lO M9/50 ODS^3 . ) afforded the deprotected acyl pCpA 
in 81% yield. All products were characterized by UV, NMR and 
2D NMR. 

30 Figure 8 shows purification of Phe66 iS -lactamase 

synthesized according to Figure 1. i9-lactamase was purified 
from 900 /iL pF66am-primed reaction that had been supplemented 
with 150 Mg chemically acylated Phe-tRNA^^ following the 
procedure described in Figure 4, Typical yields were 0.3-0.7 

35 iiq (7-15%) of purified enzyme, starting from 4.5 iiq in crude 
reaction. 

Samples (50-200 ng/band) were electrophoresed on a 
12.5% SDS-polyacrylamide gel (Laemmli, (1970) Nature 
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(London^ . 227:680), which was subseguent:ly silver stained. 
The diffuse bands at M2.-67000 and 60000 are artifacts 
commonly observed during high sensitivity silver staining 
(Merril et al., (1983) Methods Enzvmol . ^ 96:230). Lane 1: 
5 Crude in vitro reaction; Lane 2: Purified in vitro /9- 

lacteuaase; Lane 3: Purified in vivo iS-lactamase (JM101/pS67) . 

Figure 9 shows tryptic peptide mapping of tryptic 
digest and peptide mapping of wild-type and suppressed 
lactamase. Wild-type ^-lactamase was xiniformly labelled with 

10 [H] -phenylalanine by in vitro protein synthesis from pSG7 in 
the presence of added [H] -phenylalanine. Non-labelled ^- 
lactamase was added to the products of the in vitro synthesis 
prior to purification of the enzyme by gel filtration on 
sephadex G-75 (Pharmacia) and chroma to focusing chromatography 

15 as described in Figure 4. The sequence of /S-lactamase from 
pSG7 is shown, trypsin cleavage sites are indicated by 
spaces, peptides containing Phe are indicated by bold type, 
Phe 66 is underlined: MSHPETLVK VK DAEDQLGAR VGYIELDLNSGK 
JLESFRPEER FPMMSTFK VLLCGAVLSR VDAGQEQLGR R IHYSQNDLVEYSPVTEK 

20 HLTDGMTVR EXiCSAAlTKSDKTAANLLLTTXGGPK EI/rAFi;HNHGDHVTR LDR 
WEPELNEAIPNDER DTTMPAAMATTLR K LLTGELLTLASR QQLIDWMEADK 
VAGPIJliR SALPAGWFXADK SGAGER GSR GHAALGPDGKPSR 
IWIYTTGSQATMDER NR QIAEIGASLIK HW. The tryptic peptides 
were separated using a Pharmacia Pep RPC 5/5 column. The 

25 absorbance of the col\imn effluent was monitored at 254 nm 

(panel A) and fractions of 0.5 mL were collected and counted 
(panel B) • Radioactive suppressed /S-lactcuaase was 
synthesized In vitro from pF66affl in the presence of added 
[^H3-Phe, tRNA^c^u\* purification and trypsin digestion 

3 0 of the suppressed ^-lactamase were identical except that only 
6,000 CPM of labelled suppressed enzyme were used in 
digestion reactions (panel C) . 

Figure 10 shows the characterization of native and 
mutant /3-lactamase. Wild-type and Phe 66-suppressed ^- 

35 lactamase were purified to homogeneity from ImL in vitro 
reactions primed with pSG7 and pF66ain/Phe-tRNA^^, 
respectively. Initial rates of nitrocefin hydrolysis were 
determined, at 24**C in 50 mM sodium phosphate, pH 7/0.5% 
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DMSOr substratie concentra-bions ranging £roia 25-250 /xM. 
and V^^^ values were obt;ained from £adie-Hofst:ee plots, and 
Bradford assay guantita-kions of tti& enzymes were used -to 
determine k^^^ values. 
5 Kinetic parameters for the mutant enzymes were 

determined as follows: In vitro reactions (6 0^x1) containing 
15 /iGi[^^S] -methionine (Amersham) were primed with pF66y, 
pF66am/Phe-tRNA^^^U®j^, pF66aia/p-FPhe-tIWA^C^^®j^rPF66am/ 
Phe-tRlTA^^*^^\, pF66am/HPhe-tRNA^^^^\. pF66a2a/PLA- 

10 tRNA^^^^^^, pF66am/ABPA-tRNA^^^u\' pF66am/D-Phe- 

tRNA^^*^^^^ and incubated at 37*0 for 30 min. and V^^^ 
values were determined as described above using crude enzyme 
immediately following incubation. Quantitation of the 
enzymes was achieved by first precipitation the crude 

15 reaction with trichloroacetic acid and washing at 90 "C to 
remove unincorporated label (Pratt in "Transcription and 
Translation", Hanes and Higgins (Eds.)f Press, Oxford, 

1984, pp. 179-209). The measured incorporated radioactivity 
for the Phe66 enzyme was then used, together with the k^^t 

20 88 Os*^ determined from Bradford assay quantitation, to 

calculate the amount of incorporated radioactivity/Mmol of 
enzyme (typically 5 mCi/Mmol) • This value was then used to 
guantitate the mutant enzymes, based on the incorporated 
radioactivity measured for each. Values shown are the 

25 averages of three determinations. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

The present invention provides novel methods for 
synthesizing proteins containing unnatural amino acids at 

30 specific sites The methods preferably utilize modified 
amlnoacyl tRNA's capable of polymerizing the desired 
unnatural amino acid(s) at unique codon(s) within an mRNA 
sequence. Utilizing these methods, a wide variety of 
xmnatural amino acids may be selectively introduced into 

35 proteins of interest. The methods can provide proteins which 
are substantially homogeneously substituted at selected sites 
in stoichiometric amounts. The procedures are inherently 
simple and allow control of both the type and site o£ 
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modifications to a protein molecule within a virtually 
limitless number of possible variations. 

One aspect of the invention relates to the 
production of modified tRNA molecules and their use in 
5 producing desired proteins as follows: 

a) preparing a nucleic acid sequence capsO^le of 
being translated into a desired polypeptide, the nucleic acid 
sequence including at least one codon which will be dedicated 
to a desired preselected amino acid substitution within the 

10 polypeptide; 

b) obtaining or synthesizing an aminoacyl tRNA 
analogue which will recognize the dedicated codon and 
function as an adaptor molecule to direct the polymerization 
of the amino acid substitution into the polypeptide; 

15 c) combining the nucleic acid sequence with a 

protein translation system containing the aminoacyl tRNA 
analogue, whereby the translation system will function to 
normally translate the nucleic acid message, except that the 
aminoacyl tPNA analogue will direct the incorporation of the 

20 eunino acid substitution for the otherwise naturally occurring 
corresponding natural amino acid; and 

d) allowing the translation system to function so 
the sequence will be translated and the system will 
substitute at the direction of the selected codon the 

25 corresponding predetermined amino acid analogue into the 
resultant protein. 

Although each of these steps relate to important 
parts of the invention, various modifications will be readily 
apparent to one skilled in the art: to adapt the procedures to 

30 particular specific uses as detailed below. 

Proteins are fundaunental building blocks of living 
organisms and serve multiple functions. Typically they serve 
structural functions, catalytic (or enzymatic) functions or a 
mixture of the two. Proteins are synthesized on ribosomes 

35 which polymerize polypeptide chains out of a set of 2 0 common 
amino acid monomers according to the information contained in 
the sequence of nucleotides making up the "messenger RNA" 
(mRNA) . The mRNA is "translated" by the ribosomes which 
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"read" three-nucleotide segments (one codon) at a time. From 
a particular AUG, or initiation codon ^ the ribosomes read 
successively in three-nucleotide segments establishing the 
. "frame" of translation. 
5 RNA is composed of 4 different types of nucleotides 

containing the adenine, cytosine, guanine and uracil. In 
three-nucleotide segments (codons) , there are 64 possible 
sequence combinations , three of which normally will not 
translate and result in termination of further polypeptide 

10 elongation* These three codons, UAG, UAA and UGA, are the 
normal termination codons. The other 61 codons have 
corresponding adaptor molecules (tRNA's) which recognize (or 
. match) the message codon by complementarily matching with 
these bases. The complementary sequence is contained in this 

15 "anticodon" of the tRNA. Another segment of this tRNA 

adapter molecule serves to position an amino acid at the 
correct site in the ribosome to serve as a substrate for the 
"elongation" reaction, whereby the nascent chain is 
polymerized to the aminoacyl moiety on the tKNA. The 5*^ 

20 terminal codon codes for the amino terminal amino acid, and 
successive codons direct the successive carboxy addition of 
the next amino acid in the nascent chain. Thus, the 
polypeptide chain is synthesized beginning at the amino 
terminus, with each sixbsequent amino acid added at the 

25 carboxy teinainus. 

Normally, the tRNA is enzymatically "charged" with 
the correct amino acid moiety with extremely high fidelity so 
that the adaptor molecule has the correct amino acid attached 
which properly corresponds to the anticodon. If a tRNA is 

30 "mischarged" , that tRNA will properly recognize the anticodon 
and the properly positioned; but improperly acylated amino 
acid moiety will, nevertheless, be polymerized into the 
nascent polypeptide chain. Furthermore, this can be extended 
to a tRNA which has a modified anticodon matching a 

35 termination codon. These are loiown as "suppressor" tRNA's, 
because they suppress the effect of in-frame chain 
termination codons which may have been introduced into a 
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message. This phenomenon is, in pari:, a fundamental basis of 
this invention. 

This invention uses processes and molecules which, 
in many cases, have not been unicjuely defined chemically, and 
5 uses general terms which do not necessarily match the uses 
precisely the same by some in the field. The following 
definitions are primarily based on functionalities. Much of 
the state of the art and concepts utilized here are contained 
in Watson et al., (1987) Molecular Biolocrv of the Gene. Vols. 
10 1 and 2, hereafter referred to as Watson et al.. Gene ,, 
specifically herein incorporated by reference. 

The term "reading" is the process by which the 
translation system recognizes a given codon of the message 
and polymerizes, at the direction of that codon, a particular 
15 amino acid into the corresponding position of the nascent 
polypeptide chain. 

The term "misreading" is used to refer to 
mistranslation relative to the code of correspondence between 
the codon and the amino acid inserted into the nascent 
20 polypeptide chain synthesized by the original translation 

system (i.e., before the selected aminoacyl tRNA is otherwise 
incorporated into the translation system) • 

The term "termination codon" refers to the codons 
normally used to signal translation termination in the 
25 translation system of use. Where a natural source for the 
translation system is used, these will typically be the 
codons utilized in the "universal code". Normally these are 
the codons UGA, UAG and DAA. However, it is possible to 
generate translation systems with an entirely different 
30 correspondence of codon with amino acid, and so the term is 

also extended to include whatever codon is used in the system 
being utilized. 

The term "tRNA analogue" refers to any molecule 
which is an analogue of a tRNA with respect to the activities 
35 of nascent peptide chain translocation and codon recognition. 
Each particular tRNA species is not a definitive homogeneous 
chemical entity, since niimerous methylation or other 
modifications or changes in the primary nucleic acid sequence 
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may be made whicli may have minor or no effect on its 
essential properties. The term is here broadened beyond its 
use to indicate a nebulously defined core chemical entity, 
including various modified forms thereof. Generally, all 
5 tRNA molecules which function to recognize a specific codon 
and are transcribed directly from a single gene will be 
considered collectively as "a tRNA". Herein, the functional 
definition is more relevant than a chemical description. 

miile many tRNAs from various sources have been 

10 defined in a general sense according to the "core" nucleotide 
backbone sequence (Sprinzel et al., (1987) Nucleic Acid 
Research 15 2R53 t GenBanlcVEMBL DataBank) , the number and 
sites of methylations and other modifications may be 
heterogeneous or imprecisely defined. This definition is 

15 specifically intended to include each variant of a 

heterogeneous or homogeneous category of molecules containing 
minor modifications of a known tRNA including, but not 
limited to^ differences in the methylation or other 
modification patterns^ differences in the nucleic acid 

20 sequence of the tRNA backbone (including sxabstitutions , 
additions, deletions^ and modified bases), tRNAs from 
exogenous sources r and other molecules which may have 
relevant functions common to tRNA molecules. The efficiency 
of tiie interactions between the translational components 

25 (e.g.,. ribosome, elongation factors/ tRNA's) need not be 

especially high, but a person of ordinary skill in the art 
will be able to recognize the essential functions relevant to 
the invention in each of its various embodiments. These 
minimal tRNA functionalities require that the molecule may be 

30 acylated by some process, enzymatic or chemical, and that the 
acylated molecule have. adapter molecule activity. The 
kinetics of interactions are not especially critical but may 
be import2mt in terms of efficiency. 

The term "aminoacyl tRNA analogue" refers to any 

35 analogue of an aminoacyl tRNA molecule which: 

a) functions as an adapter molecule, in such a 
manner that it will appropriately interact with a messenger 
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RNA, a ribosome, associated -translation and elongation 
factors and the nascent polypeptide chain; and 
b) which contains: 

1) a functional anticodon entity and 
5 ii) an amino acid analogue moiety that can be 

polymerized into the nascent polypeptide 
chain. 

It should also be noted that in some procedures 
based on the present invention, it may be sufficient that the 
10 amino acid analogue be the terminal amino acid and, thus, 

polymerization into the nascent polypeptide molecule need not 
necessarily imply that the amino acid moiety itself be 
capable of accepting further amino acid monomers (i.e., 
further peptide elongation) . 
15 Chemically, an aminoacyl tRNA may be defined as a 

molecule comprising: 

X - A - y - B - M; 
where X is the 5' segment of a tRNA, consisting of 
nucleotides and modified nucleotides (methylations and other 
20 modifications on the base components) making up the '*D loop" 
and part (to the anticodon entity) of the "anticodon loop"; 

A is the anticodon segment of the tRNA; narrowly 
defined as the 3 nucleotides which match with the codon to be 
translated, but may be extended to include adjacent 
25 nucleotides within 3 nucleotides of the anticodon; 

Y is the 3 * segment of a tRNA, consisting of 
nucleotides and modified nucleotides (methylations and 
modifications on the base components) making up part of the 
"anticodon loop" and the "variable" and "TOC" loops and 
30 acceptor stem; 

B is the 5'-pCpCpA-3' terminus of the tRNA, 
normally not coded by the tRNA gene and added on by the 3 ■ 
tRNA nucleotidyltransferase; and 

M is the amino acid moiety. 
35 Note that this definition is not meant to exclude 

the possibility of aminoacylating a shortened tRNA molecule 
with 3» terminal nucleotides removed, such as tRNA(-A) , tRNA 
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(-CA) or tPNA(-CCA} . Xf functional, -they would be equivalent 
to a tPNA as used herein* 

The term "multi-nucleotide" (MNM) refers to a short 
segment of nucleic acid, typically ribonucleic acid. The 
5 term is used in the context of the preparation of an 

aminoacyl tRNA analogue. The corresponding tRNA (i.e., the 
deacylated form) is shortened by removal of a short segment 
(Z) to form the "tENA(-2)" or tRNA molecule with the 
corresponding Z segment removed. Alternatively, a truncated 

10 tPNA(-Z) can. be generated directly by recombinant DNA or 

chemical methods. The Z segment corresponds in some sense, 
to the multi-nucleotide(MNM) . Thus, upon ligation of the 
shortened tRNA (-Z) to the multi-nucleotide molecule (MNM), a 
tRNA results which is equivalent to the deacylated aminoacyl 

15 tRNA analogue. One method of the invention uses 

aminoacylated-MNM's as substrates for ligation of tRNA(-Z) 
molecules to form aminoacyl tRNA analogues. 

Thus,, the term "tRNA{-Z)" refers to that molecule 
which is ligated to the aminoacyl-multi-nucleotide to produce 

20 a functional aminoacyl tRNA analogue molecule. It is 

particularly intended to include the component which is a 
shortened form of a tRNA, typically with a few of the 3 • 
terminal nucleotides removed. The ligation of the tRNA(-Z) 
to the aminoacyl -multi-nucleotide (aminoacyl-MNM) generates a 

25 molecule which will become (or is) a functional aminoacyl 
tRNA analogue. Typically Z and MNM are equivalent and, in 
the preferred embodiment, will be the 5 • -pCpA-3 • 
dinucleotide . 

The term "uxmatural amino acid analogue" refers to 

30 a molecule that is either directly an analogue or 

modification of an amino aeid. It would include modified 
natural amino acids, unnatural amino acids, analogues of 
amino acids and derivatives of amino acids. 

The set of natural amino acids would include those 

35 amino acids that are commonly used in the polymerization 

process performed by ribosomes. Normally, these amino acids 
have codons which operate to signal for polymerization into 
protein. Although unusual amino acids exist naturally in 
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proteins, they are usually relatively simple laodif ications of 
members of the group of twenty common amino acids. Also 
included would be amino acids which actually do occur in 
nature, but are not polymerized in their final form during 
5 the polymerization (or translation) process. These natural 
modifications apparently result from post-polymerization 
modification of the amino acid that is performed either in 
the nascent chain stage, or more probably, upon completion of 
the polypeptide chain. These include the post- 
10 translationally modified amino acids such as 4- 

hydroxyproline, 5 -hydroxy lysine, cystine and others. These 
modifications are made enzymatically and are highly 
restricted both in the type of modification made and the 
amino acid which is targeted for modification. 
15 Modified natural amino acids, unnatural amino 

acids, analogues of amino acids and derivatives of amino 
acids are intended to include all functional modifications of 
analogues of amino acids, both alpha and otherwise. This 
would also include modifications which involve substitution 
20 or addition of unusual atoms, addition of side groups 

including cof actors or their binding sites, glycosylations 
and acetylations. 

The term "preselected codon" refers to a codon 
which is intended to be changed and will, in some functional 
25 form, be within the reading frame of the protein to be 

produced. Thus, if a particular sequence has more than one 
reading frame, the codon need only be a change intended to 
affect one of them. 

The term "site specific incorporation" refers to 
3 0 the introduction into known sites of either a particular 

codon into a specific site in the reading frsune of a message, 
or of a particular amino acid analogue into a specific site 
in a polypeptide chain. It will be recognized that since 
there is a one to one positional correspondence between codon 
35 positions and their integrated amino acid sites, the site of 
an amino acid analogue substitution is determined by its 
corresponding codon position. Consequently, site specificity 
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of amino acids may be derived from determination of ^e codon 
site, Gind vice versa. . 

The nascent, polypeptide chain Is the incompleted 
polypeptide chain resulting from the translation of the mRNA 
5 which is 5« proximate to the current codon. The current 
codon "directs" the specificity of the next amino acid 
analogue which is to be polymerized in the nascent chain. In 
the normal elongation process, the nascent polypeptide chain 
is polymerized onto the amlnoacyl moiety atteched to the tENA 
10 which recognizes the codon adjacent to the A site of the 
ribosome. 

Use of the terms protein and polypeptide is 
Intended to Include the products of the system that are 
modified mollacules substantially equivalent to a protein or 
15 polypeptide. This is meant specifically to include a protein 
or polypeptide, as well as both its apoprotein and 
holoprotein forms. Included In the definition are 
polypeptide molecules: 

a) having a modified amino acid substituted at 
20 the normal site of an amino acid (equivalent to a modified 

amino acid) ; or 

b) having an amino acid like moiety which may 
differ in structure or composition, including, but not 
limited to: 

25 1) a moiety which would have the peptidyl 

linkage involving an amino group off a beta, gamma, delta or 
-Other carbon atom (I.e., non-alpha amino acid); 

ii) a moiety which might have an atom other 
than a carbon or nitrogen atom along the polypeptide 
30 backbone; 

ill) an amino acid containing a side chain R 
which may correspond to a synthetic side chain (Including 
heteroatoms, cyclic or acyclic groups or metal binding 
groups; also radioisotopic or Isotoplc substitutions) ; 
35 iv) euaino acids in which the carboxylate is 

replaced by other groups such as sulfonyl, phosphoryl, 
phosphonyl and the like; and 
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V) amino acids with restricted xp, <p angles, 
such as proline analogues or di-alpha substituted amino 
acids. 

These methods should be applicable to essentially 
5 any protein, particularly enzymes displaying catalytic^ 
receptor or ligand binding functions, structural or mixed 
functionalities. Among the catalytic proteins are included, 
but not limited to peptidases, nucleases, glycosidases, mono- 
and dloxygenases , pyridoxalphosphate and flavin dependent 
10 enzymes, lipases and aldolases. Among the receptor proteins 
are included, but not limited to, antibodies, T-cell 
receptors, muscarinic receptors, G-proteins, lectins, DNA 
binding proteins and cytochromes. Among the structural 
proteins are included, but not limited to myosin and silk. 
15 The term "substrate binding site" refers to those 

the portions of the polypeptide chain whose amino acids are 
located near to (or are important in conferring) the native 
three-dimensional spatial conformation of the protein 
important In substrate binding, or those amino acids situated 
20 nearby in space to the region where a substrate or ligand is 
bound to the polypeptide backbone. 

The term "enzymatic active site" refers to those 
amino acid residues which are situated near to or are 
involved in conferring the essential spatial or chemical 
25 properties necessary for an enzyme to catalyze a reaction. 

The term "protein-protein interface" refers to the 
residues nearby the region where distinct polypeptide chains 
interact. 

The term "cofactor binding site" refers to those 
30 residues involved in, or nearby the site where a cofactor or 
ligand becomes attached or are Involved in the recognition 
for where such might be attached. 

It will be recognized that there are a multitude of 
ways to generate nucleic acid sequences containing the 
35 selected codon at the specific site and which will translate 
to create the polypeptide sequence of interest. These 
methods include but are not limited to use of natural 
sequences, modifications of natural sequences, partially or 
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Wholly syntlietlc sequences and coinbinat:ions of various 
na-bural sequences to create hybrid new prot:eins. See, 
Maniatis; Wu and Grossman, Me-thods in Enzvmoloay , Vol, 153, 
and Ausubel et al., (1987) Curren"b Protiocols in Molecular 
5 Biology, Vols. 1 and 2, each of which is hereby specifically 
incorporated by reference. 

Such seG[uences would include both DNA forms and RNA 
forms, DNA forms would include, but are not limited to, 
sequences integrated into a genome, sequences integrated into 

10 extrachromosomal elements (including plasmids, episomes and 
minichromosomes or other free DNAs) , phages, viruses, and 
other similar forms. Simxlar ENA forms are also included. 

The sequence of the translated HNA may be changed 
by substituting different redundant codons at various sites. 

15 It is not well understood why one ^specific codon is used 

instead of another redundant one, arid each redundant codon 
might be replaced with one of them. In theory, in the 
absence of '^wobble", one could generate a translation system 
which would utilize as many as 63 different amino acids plus 

20 one terxoination codon (see, Watson et al. , Gene ) . By 
application of these techniques, one could generate a 
translation system with a genetic code quite different from 
the "universal code".. In particular, the starting sequence 
for th^ desired product may be natural, a modified sequence 

25 or a totally synthetic one. 

The site of the substitution may be changed to any 
codon which is intended to be generally "dedicated" to 
insertion of the specified preselected amino acid. In the 
preferred form of use, one would select a codon which does 

30 not participate in a "wobble" type redundancy and thus be 
subject to being translated by an aminoacyl tRNA different 
from the one selected to perform the misreading. While a 
truly unicpie correspondence between the codon and aminoacyl 
tRNA would provide virtually stoichiometric substitution, 

35 this would be diluted through any mechanism which woxild allow 
another aminoacyl tRNA to function as the adapter molecule 
recognizing the selected mistranslation codon. Thus, 
although any codon could theoretically be chosen to code for 
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the mistranslation^ one would normally select a codon which 
is not utilized in the reading frame anywhere else in the 
polypeptide, and would not be translated with any existing 
aminoacyl tRNA contained in the ultimate translation system 
5 to be used. 

For these reasons, the termination codons are well 
suited because there will not be o1:her in frame termination 
codons in the sequence. Optimally a termination codon 
different: from that actually used to terminate translation 

10 would be selected. Furthermore, the unique codon selected 
may be unique by virtue of having been made so by gene 
synthesis. Uniqueness would result from substituting all 
other sites containing that codon to different redundant 
codons, thus leaving that particular site as the sole site 

15 containing the selected codon. In addition, this system 

could be easily used to made two or more substitutions, of 
the same predetermined sunino acid analogue, or of two or more 
different analogues, by virtue of selecting multiple unique 
codons • 

20 The term "substantially homogeneous** relates to the 

concept of homogeneity of modifications with respect to both 
site and type. A particular modification is substantially 
homogeneous when a large majority of the resulting 
translation product is homogeneous, typically greater than 

25 60% are of a single form, preferably greater than 80% 

identical, and optimally virtually all, greater than 98%, are 
identical • 

The term "siibstantially stoichiometric" refers to 
the property that most of the products are siibstituted at an 
30 intended site, typically more than about 70 to 80% of the 
products are substituted, preferably more than 90% are 
substituted, and optimally virtually all, more than 98%, are 
substituted. 

A ■•protein synthesizing system** is a system which 
35 comprises ribosomes, tRNAs, elongation factors and all of the 
other components necessary to translate a mRNA into protein 
upon providing the mRNA and appropriate conditions. It is 
also referred to as a ••translation system". Typically, a 
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cell Iniierently possesses a prot:eln synt:heslzlng sys-kem, but 
which may have a low level of activi-ty for various reasons. 
While in vivo systems may be utilized, for the uses described 
herein^, difficulties associated with the introduction of 
5 necessary aminoacylated tRNA analogues may exist. This may 
be achieved by standard microinjection procedtires or by any 
other mechanism of introducing externally produced molecules 
into the cell, such as electr operation of spheroplasts . An 
obvious possible technique is either cell or liposome 

10 . fusions^ using such procedures as polyethylene glycol or 
Sendai viral f us ions . 

One preferred translation system is the frog oocyte 
with microinjectibn,^ which will also find use for translation 
systems in other large cells* More typically, an in vitro 

15 system is preferred because it is easier to introduce a 
higher concentration of charged unnatural aminoacyl tRNA 
molecules to the system. Such systems are available 
commercially and have been derived from ly sates of cells from 
E , coli , S. cerevislae . wheat germ and rabbit reticulocytes. 

20 Inherent in the procedure is the capability for 

using the same single message to direct different translation 
systems which incorporate distinct unusual aminoacyl tHNAs. 
Different translation systems may be utilized to incorporate 
a different tinusual amino acid into the selected site, thus 

25 generating a series of products, each differing by the 

insertion of the appropriate preselected unusual amino acid 
at the selected site. This also makes a termination codon 
the preferred selected codon as it would not otherwise have a 
corxesponding aminoacyl tRNA. The proper translation systems 

30 can be made without the need to remove a preexisting tRNA, 
but merely by the addition of a new one. 

Synthesis of a ftmctional unusual aminoacyl tRNA 
Involves a complicated process of: 

a) selection of the correct anticodon to use; and 

35 h) attachment of the appropriate predetermined 

amino acid analogue. 

Once the mistranslation codon is selected, a tRNA 
with the corresponding anticodon must either be selected or 
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manufactured. Selection is preferably performed by isolating 
a natural tRNA. Manufacture may be by mutation and 
selection, or by site specifically introducing the 
appropriate anticodon* 
5 The modified translation system normally will not 

utilize the enzymatic acylation of the unusual adapter 
molecule, thus the functional definition of tRNA need not 
normally include the enzymatic acylation function. This, 
however, does not preclude the use of enzymatic acylation 
10 where possible, in which case an acylation capability would 
be important. 

The attachment of the amino acid to a tRNA is 
naturally catalyzed by the aminoacyl tRNA synthetases. These 
enzymes are reversible and are highly specific both for the 
15 appropriate tRNA (though acylatipn specificity does not use 
the anticodon for recognition) and for the amino acid to be 
attached^ Although it may occasionally be possible to use 
the natural synthetases, or perhaps to modify their 
specificity, such will be tmusual. Thus, the synthesis of 
20 the appropriate aminoacyl tRNA is very important. And 

because the reaction is reversible, it is important that the 
unusual aminoacyl tRNA not be subject to deacylation 
activity, either due to the aminoacyl tRNA^s inherent 
inability to act as a siibstrate for any synthetase present, 
25 or by elimination of the deacylating synthetase by some 

method ( inactivation by mutation or chemical modification, 
through antibody removal or some other means) • 

The aminoacyl tRNA is of central importance to this 
invention and the synthesis of the molecule is a major 
30 aspect. Where no synthetase exists for an xmusual amino 
acid, some means must be devised to make the adapter 
molecule. 

Infrequently, the unusual amino acid might be a 
siibstrate for a synthetase and be charged onto an appropriate 
35 tRNA. In some few other cases, it might be possible to 

modify the amino acid portion of an aminoacyl tRNA without 
destroying its adapter function. This will seldom be 
feasible because the occurrence of reactive groups on the 
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nucleic acid portion will normally compete and eliminate the 
specificity of the modification reactions* 

An alternative method for the synthesis of an 
unusual aminoacyl tRNA is to synthesize an aminoacyl 
5 nucleotide, and then to ligate this moiety onto the 

appropriate tRNAC-Z) molecule. This method is generally 
applicable for virtually any aminoacyl tRNA molecule, 
including attaching normal amino acids,, though much less 
efficient than the synthetase reactions. The only restraints 

10 are that the unusual amino acid not interfere with the 

acylation oir deprotection steps and that it not interfere 
with the ligation step. Xf so, there is likely to be 
alternative chemistry to synthesize the adapter molecule. 
The general scheme is to attach the unusual amino acid onto 

15 an oligonucleotide and then to ligate together the nucleotide 
portions, preferably with T4 RNA ligase. A dinucleotide is 
preferred because it minimizes interference in the chemistry 
linking the amino acid to the nucleotides and provides a 
higher efficiency of ligation than a single nucleotide or 

20 AppA analogue. Normally the 3* terminal nucleotides on a 
tRNA are 5 • -pCpCpA-3 ' , so the dinucleotide of choice is 5'- 
pCpA-3 • . - It would likely be possible to use other 
nucleotides (either di- or oligo) such as deoxy-C-ribo-A 
(i.e., pdCpA) or an entire deoxy-RNA (i*e.^ DNA) with a 3» 

25 terminal ribo-A. After attachment of the unusual amino acid 
to the nucleotides, preferably using a dinucleotide, the 
tRNA(-Z) is ligated to the aminoacyl-multi-nucleotide (aa~ 
MNM) to generate the final aminoacyl tRNA analogue. 

The ligation step is performed by chemistry or by 

3 0 enzymatic means, the enzyme may be any which has ligation 

activity on single stranded RNA molecules. The dinucleotide 
is a sufficiently long substrate for the T4 RNA ligase used 
in the examples, other enzymes might require a longer or 
shorter substrate. It will also be observed that the 

35 deprotection reactions might, in some cases, be performed 
after the ligation step. 

The source of the tRNA(-Z) component may come from 
processed natural tRNAs. One source is gene synthesis of a 
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tRNA p u A' Change of the anticodon of natural 

tBNA^ ^ to a termination suppressor destroys recognition of 
the tRNA by the aninoacyl tRNA synthetases. 

Another alternative includes making suppressor 
5 tRNA(-Z) •s by "runoff transcription", which will not be 
modified as normal tRNAs, but having some substantial 
fraction of activity in tramslation. Since each tRNA 
molecule will typically be acylated chemically only once (as 
opposed to the normal enzymatic reaction) , it is generally 
10 preferable to create an aminoacyl tRNA that may function 

somewhat less efficiently in the elongation reaction, if very 
large quantities of the appropriate tRNA(-Z) can be made for 
convenient performance of the acylation chemistry. Use of 
specially designed systems for transcribing the appropriate 
15 tRNA acceptor molecules at high etfficiency, but not modified, 
may be very important and are included as possible sources of 
these molecules. Thus, unmodified tRNA molecules are 
included in the specifications even though not included in 
the normal definition of tRNAs. 
20 The chemical procedure of making the aminoacyl 

tRNAs may be easily modified from the described method. The 
most obvious is to use a slightly modified tRNA(-Z) , which 
may be slightly longer or shorter or modified from the 
starting molecule. These molecules are hereby included 
25 expressly in the specifications. Toother obvious 

modification is to use, instead of a dinucleotide, a 
mononucleotide, trinucleotide, or other oligonucleotide. 
These are also included in the specifications, all included 
in the multi-nucleotide (MNM) molecule definition. Use of a 
30 different nucleotide combination from 5*-pCpA-3» is a likely 
possibility. 

There are at least three acylation routes which may 
be used to synthesize the aminoacyl-dinucleotides (see Figure 
1) . Generally, they include blocking particularly reactive 
35 groups on the dinucleotides , attachment of the amino acid to 
the 3 ' terminal ribose ring and then removal of the blocking 
groups . 
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The flrs'b route involves -trealiment: of tJie 
dinucleotides with nitrophenylsulf enyl chloride (NPS-Cl) to 
block the cytidine base group. Reaction with an aminoacyl- 
NPS in l^l'-carbonyldiimidazole (CDI) will attach the 
5 derivatized amino acid to hydroxy 1 groups on the 3* 

nucleotide ribose ring. Treatment with thiosulfate will 
remove the NPS from the cytidine leaving the aminoacyl- 
dinucleotide • 

A second route involves direct synthesis of a 

10 dinucleotide or treatment of the dinucleotide with 9- 

fluorenylmethyloxycarbonyl chloride (FMOCCl) ^-cyanoethyl 
chloride (EtCNOCl) and tetrahydropyranyl chloride (THPCl) , 
which will block the phosphdryl groups, nucleotide base and 
ribose 2» hydroxyl groups. Reaction with aminoacyl - 2-(4- 

15 biphenyl) isopropyloxycarboxyl in CDI will attach to the 3' 

hydroxyl group. Treatment with l,i;3,3 tetramethylguanidine, 
2-pyridinealdoxime and formic acid will remove all the 
blocking groups to yield the aminoacyl-dinucleotide. An 
aminoacyl - vinyloxycarbonyl (aa-VOC) may be substituted for 

20 the aminoacyl*-BPOC« 

A third metiiod of synthesis involves synthesis of 
or treatment of the dinucleotide with benzyloxycarbonyl (CBZ) 
and tetrahydropyranyl (THP) resulting in blocking of the 
cytidine base and the ribose 2' and 3* hydroxyl groups. 

25 Reaction with aminoacyl - benzyloxycarbonyl in GDI will cause 
attachment to the ribose 2 'OH group. Treatment with 
palladium and BaSO^ in H2 and acid will remove the blocking 
groups to yield the aminoacyl-dinucleotide. It has been 
demonstrated that the carbobenzoxy amino acids can be coupled 

30 to pC^^pA and the NPS and CBZ groups removed in 35% overall 
yield. 

There may also be enzymes with the present 
capability to acylate nucleotides or may soon be engineered 
to be able to acylate the oligonucleotides or tRNA (Thirsod 
35 and Klibanov, (1986) J. Am. Chem. Soc. . 108:5638 & 3977? 

Sweers and Wong, (1986) J. Am. Chem, Soc. . 108:6421; Shaw and 
Klibanov, (1987) Biotech. Bioena, . 29:648; and Wong et al. , 
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Fluorocarbohvdr ates: Chemistry & Biochemistiirv , Taylor (Ed.)/ 
ACS Symposium Series , ACS Washington, D.C.)^ 

The selection of a predetermined amino acid 
analogue to incorporate into the polypeptide will be driven 
5 by the needs of the user. Some may desire to substitute any 
of a number of specific modified amino acids into the site, 
for any of a number of different purposes. In particular, 
those of most interest will be those residues which may 
modify specificity, activity or structure of the protein to 
10 satisfy new requirements. Part of the power of this 

technique is the important potential to break out of the 
previous limitation of choices among only the natural amino 
acids. The present invention allows substitution of 
virtually any L-amino acid, natural or unnatural, as well as 
15 D-amino acids. 

Among various uses for the proteins of the present 
invention are the introduction of particular types of 
residues including, but not limited to, incorporation of: 

a) heavy metal atoms (useful in crystallography) ; 
20 b) cross linking agents; 

c) markers (such as radioactive, spectroscopic, 
fluorescent, magnetic, and electronic) ; 

d) electron acceptors or donors; 

e) metal chelators; 

25 f) structurally restricting residues; and 

g) residues with novel nucleophilicites; 

h) residues with altered acidities and basicites; 

i) residues with altered geometries (such as 
homoserine, homocysteine or ornithine) ; and 

3 0 j) residues with altered hydrogen bonding 

properties (e.g., amidine vs. amide). 

The physical or biochemical properties which can be 
studied by making substitutions at known and specific sites 
are many. Spectroscopic markers may be introduced to 

35 particular regions in the tertiary structure of a protein or 
complex of polypeptides. Residues may be introduced with a 
different pKa, or which will affect the pKa of nearby 
residues, with a different nucleophillicity , or which will 
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affect the nucleophilliclty of nearby residues, with electron 
acceptor function, with metaJ. chelator function, with 
modified hydrogen bond donor or acceptor function, with 
altered or restricted bond torsion angles, with cof actor 
5 . binding capability or with special markers for fluorescence 
or other detection or purification methods. The invention 
provides the opportunity to introduce into the polypeptide 
chemical groups which are beyond the range of the natural 
. amino acid residues, and to escape from many of the 

10 constraints previously imposed by nature. 

In particular, one of the most important uses will 
be the incorporation of heavy metal scattering centers into 
identical locations in a polypeptide, which will, upon 
crystallization, allow for relative ease in solving of the 

15 wave equations necessary to determine the gross three— 

. dimensional structure of a polypeptide chain. The stinicture 
of a protein is very important, and is normally the essential 
property of an enzyme which confers on it the ability to 
perform its function. These functions will include aspects 

20 of the properties of mechanism of catalysis, specificity of 
substrate binding and reaction, structural features and 
regulatory interactions • 

Although meuiy of the techniques may be most 
suitable for evaluating the physical or biochemical 

25 properties within some short distance away from the portions 
of the polypeptide chain localized nearby in space to 
substrate binding sites, enzymatic active sites, ligand 
binding sites, protein-protein interfaces and cof actor 
binding sites. Nearby, as used herein, means within about 50 

SO to 150 angstroms from the site of interest, preferably less 

than about 30 emgstroms, emd optimally within 15 angstroms in 
three dimensional space. 

In vitro translation systems will normally be used 
herein, though in some cases it may be possible to use in 

35 vivo systems » Typical in vitro translation systems are 

procaryote sources including coli, and eucaryote sources 
including rabbit reticulocyte lysates, wheat germ lysates and 
yeast lysates and heterogenous mixed systems containing 
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components from various sources. ( See , Wu and Grossman, 
Methods in Enz vmoloay , Vol. 153) • Preferred translation 
systems include modified transcription and translation 
systems exhibiting greatly increased transcription by placing 
5 the gene of Interest under control of an operably linked 
strong promoter. Alternatively, incorporation of a very 
active RNA polymerase would Increase the message level. 
Background products might be lowered by using a rifampicin 
insensitive T7 RNA polymerase to transcribe the message 
10 opereOjly linked to a T7 specific promoter, while other 
endogenous transcripts are repressed by the presence of 
rifampicin. A continuous flow translation system is a 
possible improvement for large scale production purposes 
(Spirin et al., (1988) Science 242:1162). Intracellular 
15 injection into frog oocytes, muscle cells or other large 

cells might be performed to introduce the message, tRNA or 
aminoacyl tRNA into the cells. 

In certain cases, one particular translation system 
source would be preferred. The yield of desired product or 
20 further processing may be dependent upon the presence or 

absence of activities in the translation systems. Such might 
include glycosylation, acetylation or other processing 
enzymes, or lack of proteases or other enzymes. 

Among the important inherent advantages of this 
25 method of analysis is the ability to make numerous 

substitutions at each selected site. Upon construction of a 
single mRNA having a mistranslation codon, a virtually 
limitless number of different substitutions may be made at 
that site limited only by the number of unnatural aminoacyl 
30 tRNA molecules desired to be utilized in the translation and 
mistranslation process. The selection of a termination codon 
is especially useful because no endogenous tRNA need be 
removed or inactivated. Inherently, no natural tRNA would be 
present, and the new unnatural amino acid analogue is 
35 incorporated merely by addition of the new aminoacyl tRNA 
analogue. With selection of the appropriate tRNA from a 
catalogue of unnatural aminoacyl tRNAs, any unnatural amino 
acid analogue could be siibstituted at the position selected. 
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Among Uie unnatural amino acids vliich may be 
selected are modifications of the natural amino acids, 
modifications of amino acids other than the natural ones, 
amino acids other than alpha^amino acids (i.e. beta, gamma, 
5 etc.), amino acids having a different stereospecif icity (i.e. 
amino acids, or having a different stereospecif icity at 
other asymmetric carbon or other atoms) , amino acids having 
substituted atoms or containing unusual elements and residues 
containing cof actor binding sites. 

10 A coupled transcription and translation system is 

one in which the products of the transcription system are 
directly translated by the system without purification or 
isolation of the mENA produced. The system is initially run 
under conditions which are optimum for transcriptional 

15 activity after which the conditions are optimized for 
translation of the transcripts produced. 

It is possible to specifically design a translation 
system which provides the essential featiires of this system, 
while using a translation system of significantly modified 

20 code. For this reason, a translation system having a totally 
different correspondence between codon and amino acid is also 
included. 

The protein products of this method may have a 
variety of properties, such as a) homogeneity of site and 

25 type of modifications in the proteins; b) stoichmetric 
modification (all of the subject proteins are modified, 
without dilution by unmodified forms; and c) known 
characterization for type and position of the modification. 

The large number of potential uses of such products 

30 should be immediately recognized by any protein biochemist. 

Any means of characterizing a protein should be sin^lif ied by 
a lowered background or noise from tinmodif led forms or 
heterogeneously modified foinns. The means for physical or 
biochemical analysis is as broad as the techniques available 

35 and applied to purified proteins, see > for excu&ple. Methods 
in Enzvmoloav. Vols. 1-187; I,ehninger, Biochemistrv z Stryer 
Biochemistry r and Creighton, The Proteins . 
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For example, in applying the technique of 
characterization by X-ray crystallography, the introduction 
of a heavy metal scattering center in a unic[ue and uniform 
site in a protein will greatly assist in the analysis of the 
5 wave pattern data to solve the wave equations necessary to 
determine the three dimensional protein crystal structure 
(Mathews, (1976) Annual Review of Physical Chemistry, 27:493- 
523). 

In spectroscopy applications, introduction of 
10 selected particular physical markers at rniique and uniform 
sites in the protein should be extremely useful in 
characterizing the static and dynamic fine chemical and 
electronic structure of binding sites, active sites and other 
important portions of a protein 
15 The following experimental section is offered by 

way of example and not by limitation. 
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EXPERIMENTAL 
Mutagenesis and In Vifcro Pro-bein Svntihesis 

The mu'tagenesis methodology desciribed above has 
been, developed with the well-characterized hydrolytic enzyme, 
5 RTEM )3-lactamase (Hamilton-Miller and Smith Eds. , (1979) 
"Beta-Lactamases",^ Academic Press, London ; Jack and Sykes, 
(1971) Ann, N,Y> Acad, Sci> 182:243; and Abraham, (1977) 
Antibiot, 30:51; Fisher et al., (1978) Biochemistry 17:2180). 
This bacterial (£^ coli ) enzyme is a single chain 29kD 

10 protein containing one disulfide bond (Sutcliffe, (1978) 

Proc. Natl, Aqad, Sci USA 75:3737; Ambler and Scott, (1978) 
Proc. N& tl. Acad, Sci> TJSA 75:3732; and Pollitt and Zalkin, 
(1983) J. Bacterid . 153:27). The gene encoding )9 -lactamase 
has been sequenced (Sutcliffe, (1978) Proc. Natl. Acad. Sci 

15 ITSA 75:3737; Ambler and Scott, fl978) Proc. Natl. Acad. Sci^ 
USA 75:3732; and Pollitt and Zalkin;. (1983) J. Bacterid. 
153:27),^ the three dimensional structure of a homologous 
Class A )9--lactamase has been solved (Herzberg and Moult, 
(1987) Science 236:694) and a simple spectrophotometric assay 

20 exists for enzyme activity (One nitrocefin hydrolysis unit 

(iMmole nitrocefin hydrolyzed/min/mL, 0,1 mM nitrocefin, 50mM 
phosphate buffer, pK 7) corresponds to 0.^61 fxg enzyme, as 
determined by Bradford assay; 0*Callaghan et al,, (1972) 
Antimic r Ob . Act , Chemother, 1:283). The enzyme inactivates 0- 

25 lactam antibiotics (penicillins and cephalosporins) by 

hydro ly zing the /3- lactam cunide bond. The reaction proceeds 
via a two step mechanism involving nucleophilic attack of 
Ser70 to form an acyl-enzyme intermediate,, which is then 
hydrolyzed to yield the corresponding acid and free enzyme 

30 (Fisher et al,^ (1980) Biochemistry 19:2895; Knowles, (1985) 
Acc, Chem, Res, 18:97; Dalbadie-McFarland et al., (1982) 
Proc. Natl. Acad, Sci, USA 79:6409; and Sigal et al., (1984) 
J, Biol. Chem, 259:5327). 

Phe66, which is conserved in 4 Class A /5-lactamases 

35 (AnflDler, (1979) "Beta-Lactamases"^ Hamilton-Miller and Smith 
Eds., Academic Press, New York pp. 99-125), was chosen as the 
first target for mutagenesis since a nximber of L- 
phenylalanine analogues are easily synthesized and 



wo 90/05785 



35 



PCr/US89/05256 



phenylalanine does not require additional side chain 
protection in the chemical aminoacylation step. A 2.5 A 
crystal structure of the aureus enzyme (33% homology with 
the E_i. coli enzyme) localizes the residue to an extended loop 
5 between a buried ^-sheet and an a-helical domain containing 
the active site (Herzberg and Moult, (1987) Science 236:694). 
The structural importance of this residue was confirmed by 
constructing the Phe66 Ala (pF66A) and Phe66 -* Tyr (pF66y) 
mutants (Fig. 3) , both of which yielded little activity in 

10 crude cell extracts. (All Jji vivo work was carried out using 
E. coli strain JMlOl (^acpro thi, supE, F*traD36^ proAB, 
lacl^ zmiS) . [C. Janisch-Perron et al., (1983) Gene 22:103]. 
All in vitro work was done using S-30 extracts [Pratt, (1984) 
"Transcription and Translation", Hans and Higgins, (Eds.), 

15 IRL Press, Oxford] prepared from E^. coli strain DIO (ma-20, 
irelAl, spoTl^ metBl) [Gesteland (1966) J. Mol. Biol. 16:67]). 
Attempts at purification resulted in loss of all activity for 
the F66A mutant, while t;he F66y mutant was purified in low 
yield and characterized. The Kj^ of the F66Y mutant for 

20 nitrocefin was identical to that of the wild-type enzyme, 
whereas the k^^^ was 16% that of wild-type enzyme (results 
not shown) • 

In vivo and in vitro synthesis of ^-lactamase was 
carried out using the plasmid pSG7 (Fig. 3) , which was 

25 designed with the following considerations: RTEM /9-lactamase 
is synthesized in vivo with a 2 3 -amino acid leader sequence 
that is clipped off during tramslocation across the inner 
membrane to yield fully active enzyme. Xn order to express 
active enzyme jji vitro . we used a truncated gene for 

30 lactamase (Kadonaga et al., (1984) J. Biol. Chem. 259:2149) 
in which a 63 -bp deletion corresponding to a 21-2aQino acid 
deletion in the leader sequence is sufficient for direct 
expression of active enzyme. The truncated gene was placed 
under the transcriptional control of the strong hybrid tac 

35 promoter (Amann et al., (1983) Gene 25:167), as it has been 
demonstrated that the amount of protein synthesized in an in 
vitro translation system is proportional to the amount of 
mRNA added (Reiness and Zubay, (1973) Biochem . Biophvs . Res . 
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Cgsos^ 53:967) • To t:hls end, the tnincaiied gene was also 
placed under control of tlie # promoter (Tabor and Richardson, 
(1985) Proc, Natl, Acad, Sci, USA 82:1974) from bacteriophage 
T7 with the intent of supplementing the reaction with T7 RNA 
5 polymerase, which synthesizes RNA at a rate 10 times that of 
the col i polymerase (Chamberlin and Ryan, (1982) The 
Enzymes 15:87) . The kanamycin resistance gene from Tn903 
(Oka et al., (1981) J. Mol. Biol, 147:217), cloned in the 
opposite orientation from the tac and T7 promoters, provides 

lb a selectable marker for these plasmids. 

Protein synthesis was carried out in vitro in order 
to simplify addition of the aminoacylated suppressor tRNA to 
the translational machinery. The coupled Ej. coli system 
developed by Zubay (Zubay^ (1973) Annu. Rev. Gen- 7:267), 

15 with some iapdif ications by Collins (Collins, (1979) Gene 
6:29) and Pratt (Pratt, (1984) "Transcription and 
Translation" Hanes and Higgins (Eds.), IRS Press, Oxford, 
pp. 179-209), was xised with little further modification except 
for lowering the pH of the system from 8.2 to 7.4 in order to 

20 better stabilize the base-labile acyl linkage of the added 
aminoacylated s;uppressor (Pig. 4) . 

Yields of active i9-lactamase synthesized in this 
system primed with pSG7 typically ranged from 30-45 iiq/viL of 
reaction mixture, based on the nitrocefin hydrolysis assay 

25 (see Fig. 6). This corresponds to 23-33 copies of active 

enzyme per copy of gene, and represents an 11-fold increase 
in synthesized enzyme over that directed from the wild-type 
^r promoter of the pBR322 derivative pSGl. (The amount of 
overproduction iii vivo , that is, JM101/pSG7 vs. JMlOl/pSGl, 

30 is also 11- fold, based on the specific activity of crude cell 
extracts.) Surprisingly, addition of T7 RNA 
polymerase (to a final concentration of 8500 units/mL) to 
reactions primed with the T7 promoter plasmid pSGl yielded 
levels of . active enzyme that were 65-70% of the levels 

35 produced in reactions primed with pSG7. (In vitro protein 
synthesis of 434 repressor expressed behind the strong tac 
promoter affords greater than 150 copies of protein per copy 
of gene. ) In vitro produced )3-lactamase was purified to 
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homogeneity by ammonium sulfate precipitation followed by 
chromatofocusing and anion exchange chromatography (Fig. 4) . 
Protein was determined to be homogeneous by SDS— 
polyacrylamide gel electrophoresis and had a k^^t % 
5 nitrocefin identical to that of in vivo produced enzyme. All 
suppression work was carried out using the pSG7 derivative 
pF66aiB (Fig. 3), which carries the Phe66 TAG mutation. 

suppressor tRNA Generation Characterization 
10 The suppressor tRNA used to deliver the unicpie 

amino acid to the growing peptide chain on the ribosome must 
meet two criteria: it must efficiently insert the amino acid 
in response to the UAG message and it must be neither 
acylated nor deacylated by any of the coli aminoacyl-tRNA 

15 synthetases present in the in vitro transcription/translation 
system. The first condition is necessary for producing 
quantities of protein that can be purified and further 
studied, the second condition is required to insure that only 
the desired unnatural amino acid and not one or more of the 

20 twenty natural €unino acids in the in vitro reaction will be 

inserted into the protein (Schimmel and Soli (1979) Ann. Rev. 
Blochem. 48:601; Fersht and Kaethner, (1976) Biochemistry 
15:3342; Igoli et al., (1978) Biochemistry 17:3459; Schreier 
and Schimmel (1972) Biochemistrv 11:1582; and Yarus, (1972) 

25 Proc. Natl. Acad> Sci. USA 69:1915) . An amber suppressor 

tRNA derived from yeast tRNA^^® (Bruce and Uhlenbeck (1982) 
Biochemistry 21:3921 and 21:855) was expected to meet these 
requirements based on the following obseirvations : Yeast 
tRNnF^^^^, in which residues 34-37 of yeast tRNA^*^® are 

30 replaced by 5«- CUAA-3 • , is expected to be an efficient 

suppressor based on Yarus' extended anticodon loop hypothesis 
(Yarus, (1982) Science 218:646). In addition, Bruce and 
coworkers (Miller et al., (1977) J. Mol. Bio. 109:275; Bruce 
et al., (1982) Proc. Natl. Acad. Sci. US A 79:7127; Bossi and 

35 Roth, (1980) Nature 286:123; and Steege, (1978) "Biological 
Regulation and Development", Vol. I, Goldberg Ed., Plenum 
Press) demonstrated that yeast tRNA^^^^^®^ was efficient in 
translating UAG codons in a mammalian protein synthesizing 
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system (although being somewhat less efficient in a wheat 

germ system) . Kwok and coworkers (Kwok and Wong, (1980) Can. 

J. Biochem, 58:213) have shown that coli phenylalanyl-tRNA 

synthetase (PRS) aminoacylates yeast tRNA^*^^ < 1% as well as 

5 it acylates coli tRNA^*^^. 

P h e 

Yeast tRNA^^^^ was prepared in milligram 
quantities according to the anticodon-loop replacement 
procedure of Bruce and Uhlenbeck (Bruce and Uhlenbeck, (1982) 
Biochemistry 21 1 855 and 21:3921) (Fig. 5). This procedure 

10 involves removal of the three anticodon nucleotides G-34, A- 
35 and A-36 as well as the modified nucleotide Y-37 from the 
anticodon loop of yeast tRNA^*^®. The four excised 
nucleotides are then replaced with a chemically synthesized 
CpUpApA which includes the anticodon sequence required for an 

15 amber suppressor tRNA. 

The details of this modified procedure of Bruce and 
Uhenbeck (Bruce and Uhlenbeck, (1982) Biochemistry 21:855 and 
21:3921) are as follows. After the initial depurination step 
the tRNA-y was recovered by ethanol precipitation and treated 

20 with aniline hydrochloride. The cleavage products were 
recovered by EtOH precipitation and separated on a 
preparative denaturingr PAGE gel (8%, 0.015cm x 16cm x 42cm; 5 
mg of the crude tRNA was loaded onto each gel) . The bands 
were stained with 0.02% toluidine blue^ cut out and eluted 

25 with 2 X lOml. lOOmM NaOAc (pH 4.5) , ImM EDTA and 0.1% SDS. 
The stain was removed by extractions with phenol and CHCI3, 
the tRNA half -molecules were recovered by ethanol 
precipitation. For the partial nuclease digestion the 
concentration of the RNase A was increased to 2 ^g/niL, and 

30 following ethanol precipitation, the pellet was resuspended 
in sterile water and extracted with phenol, phenol: CHCl 3 
(1:1), CHCI3 reprecipitated with ethanol. The 

ribonucleotide tetramer CpUpApA was synthesized by sequential 
phosphotriester coupling of protected nucleosides (Jones et 

35 al., (1980) Tetrahedron 36:3015; Boom and Wreesman, (1984) 
"Oligonucleotide Synthesis", Gait Ed., IRL Press, 
, Washington) . The fully deprotected CUAA was ligated onto the 
3 • tRNA half-molecule using T4 RNA ligase supplied by Takara 
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Shuzo, The conditions for the ligation were 50 ATP, 19 
tRNA (both half-molecule s are present in the reaction) , 82 0 
ixVi CUAA and 50 U/mL T4 RNA ligase. Ligation reactions were 
typically carried out on 5 mg of the RNase A*digested tRNA in 
5 a reaction volume of 10 mL. The kinase treatment was carried 
out with 4 MM tRNA, 120 MM ATP and 50 U/mL of T4 
polynucleotide kinase (Richardson (1965) Proc , Natl - Acad , 
Sci, USA 54:158; and Midgley and Murray (1985) EMBO J, 
4:2695). The final ligation was done with 25 U/mL of T4 RNA 
10 ligase. 

The suppressor produced by this method is missing 
the 3 ' terminal pCpA aminoacyl acceptor stem. These 
nucleotides can be replaced using the tRNA repair enzyme 
nucleotidyl transferase (Cudny and Deutscher, (1986) J. Biol, 
15 Chem, 261:6450) to yield a full-length yeast tRNA^^^^^- 

suppressor tRNA can be aminoacylated Jji vitro with ["^HJ-Phe 
to levels of 30-35% (based on radioactivity incorporated into 
purified [^H]-Phe - tRNA^c^^u^A^ using a large excess of yeast 
PRS. Enzymatic mlsacylation reactions (300 /liL total volume) 
20 contained the following: 4 /xM tRNA^Q^^^^ <30^g, which had 

been desalted and lyophilized following gel purification) , 80 
/iM phenylalanine, 40 mM Tris-HCl (pH 8.5), 15 mM MgCl2r 45 
/xg/mL BSA, 3.3 mM DTT, 2 mM ATP and 22 Units yeast PRS (where 
1 unit activity incorporates 100 pmol Phe in 2 minutes at 
25 37 under the following conditions: 2 m tRNA^^® 

(Boehringer Mannheim), 2 mM ATP, 3.3 mM DTT, 8.1 AtM Phe, 40 
mM Na HEPES (pH 7.4), 15 mM MgCl2, 25 mM KCl, and 50 Azg/mL 
BSA) . The reaction mixture was incubated at 37''C for 3 
minutes, then quenched by addition of 2.5 M NaOAc (pH 4.5) to 
30 10% v/v. The quenched reaction was immediately extracted 
with phenol (pre-equilibrated with 0.25 M NaOAc, pH 4.5), 
phenol: CHCI3 (1:1)/ CHCI3, then precipitated with EtOH. The 
extraction and precipitation were repeated twice more. The 
tRNA was then desalted on a Pharmacia fast desalting colximn 
35 and lyophilized. the lyophilized mixture of acylated and 

non-acylated tRNA was stored at -80 °C until immediately prior 
to its use in in vitro protein synthesis reactions. 
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Under similar reacrbion conditions wild-type yeast 
tRNA acyiates to levels of 40-45% with yeast PRS. 
Attempts at separating acylated from non-acylated tRNA's by 
BD-cellulose chromatograpliy (Kreig et al,^ (1986) Proc. Natl. 
5 Acad. Sci, USA 83:8604; Wiedmann et al., (1987) Nature 

(London) 328:830; Johnson et al., (1976) Biochemistry 15:569; 
Baldini, et al., (1988) Biochemistry 27:7951; Heckler et al., 
(1984) Tetrahedron 40:87; and Heckler et al., (1984) 
Biochemistry 23:1468) resulted in poor separation and 

10 unacceptably low yields of acylated tRNA. The mixture of 

acylated and non-acylated tRNA's is therefore used directly 
in in vitro protein synthesis reactions. We have attempted 
to misacylate with several analogues of 

phenylalanine using yeast PRS^ however these eacperiments were 

15 unsuccessful under variations in pH^ concentration of buffer 
and/ or salt^ and concentration of organic solvents. 

Importantly^ yeast tRNA^^*^^^^ is not recognized by 
the coll aminoacyl-tRNA synthetases present in our in 
^^^^o system (Fig. 6) . An In vitro reaction primed with 

20 pF66am and non-acylated suppressor, in the presence of [^H] 
phenylalanine^ results in no /9-lactamase activity and no 
radioactive band of the correct molecular weight when 
analyzed on a denaturing polyacrylamide gel. A reaction 
primed with pF66aia and [^H]-Phe-tRNA^^^^j\, the level of In 

25 vitro ^-lactamase synthesis from pF66affi is 15-20% compared to 

that for pSG7, These results demonstrate that yeast 
P h e " 
C U A ^^®ts the design criteria outlined above: the 

tRNA is not enzymatically cuninoacylated or deacylated, and it 

inserts an acylated amino acid in response to UAG. 

A run-off transcript tRNA was produced (Sampson and 

Uhlenbeckr (1988) Proc, Natl, Acad. Sci> USA 85 r 103 3) 

corresponding to the sequence of yeast tMTA^^*^^®^ (-CA) which 

has been purified and ligated to phenylalanyl-pCpA (Suich and 

Noren, unpublished results) . Addition of this aminoacylated 

35 tRNA to in vitro protein synthesis reactions affords 50-65% 

of the /3-lactamase activity observed for reactions using the 

same amounts of Phe-tRNA^^^^®^ derived from anticodon loop 

replacement. This level of translational efficiency for the 
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run-Off suppressor compares favorably to data for a tRNA^-^^ 
constructed in a similar fashion (Samuelsson et al., (1988) 
J, Biol. Chem- 263sl392), 

5 Chemical Aminoacvlatlon 

As noted above, enzymatic misacylation by the 
aminoacyl-tSNA synthetases is not a general method due to the 
high specificity of these enzymes. Chemical misacylation, 
however, should be generalizable to any amino acid-like 
10 structure. Direct chemical acylation of an intact tRNA is 
not practical due to the large number of reactive sites in 
the macromolecule, Hecht and coworkers (Heckler et al., 
(1984) Tetrahedron 40:87; and Heckler et al., (1984) 
Biochemistry 23:1468) simplified this problem by chemically 
15 acylating the dinucleotide pCpA and enzymatically ligating it 
to the 3* terminus of a truncated tRNA [tRNA(-CA)] using T4 
RNA ligase to afford an aminoacyl-tRNA. This approach, 
though successful, suffered two major drawbacks: the a-amino 
protecting group was not removed, restricting the aminoacyl- 
20 tRNA to only act as a P site donor, and the chemical 
acylation yield was quite low. 

The general strategy for chemical acylation of pCpA 
involves carboxyl activation of an N-blocked amino acid 
followed by coupling via an ester linkage to the diol of the 
25 terminal adenosine (the 2 • and 3 • acyl groups rapidly 
interconvert in aqueous solution) . Aminoacylation is 
complicated by preferential acylation of the exocylic amino 
group of cytidine and 2\ 3" diacylation of adenosine. The 
a-amino protecting group greatly increases the stability of 
30 the aminoacyl ester linkage to hydrolysis 2md avoids 

polymerization during carboxyl activation Schubert and Pinck, 
(1974) Biochimie 56:383). However, the protecting group must 
be removed if the acylated-tRNA is to function as an A site 
donor. It has recently been shown by Brunner (Kreig et al., 
35 (1986) Proc, Natl. Acad. Sci. USA 83:8604; Wiedmann et al., 
(1987) Nature (London) 328:830; Johnson et al., (1976) 
Biochemis-bi-y 15:569; Baldini, et al., (1988) Biochemis-h-r-Y 
27:7951) that a«-amino protected aminoacyl pCpA can be 
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deprot:ect:ed before ligation to tBNA(-CA) without hydrolysis 
of the aminoacyl ester linkage. 

The scheme for aminoacylation of tRNA^^,^^®^ is 
outlined in Figure 7. A minimal protection scheme was used 
5 in which only the exocyclic amine of <^tidine was protected 
by 0-nitrophenyl sulfenyl chloride (NPS-Cl) . The o-amino 
group of the amino acid was also protected with the NPS 
group. NPS-pCpA was acylated with N-blocked Phe using N,N'- 
carbonyldiunidazole as the activating agent. The NPS 
10 protecting groups were removed from cytidine and the amino 

acid in high yield using aqueous thiosulfate (Lapidot et al., 
i^970) Biochem. Biophvs. Res. Comm. 38:559 and Heikkila et 
al., (1983) Acta. Chem. Seanti. Tj-il ) The 
acylation/deprotection was carried out in 14% overall yield 
15 (unpublished results indicate that 2», 3 • -diacylation is the 
major factor limiting acylation yields in model compounds), 
which compares favorably with the 3-4% yields of Hecht and 
Brunner (^eig et al., (1986) Proc. Natl. Acad, sci. nsA 
83:8604; Wiedmann et al., (1987) Nature f London^ 328:830; 
20 Johnson et al., (1976) Bioehemist-ry 15:569; Baldini, et al., 
f 1988) BiochemistT-y 27:7951; Heckler et al. , (1984) 
Tetrahedron 40:87; and Heckler et al., (1984) Biochemistry 
23:1468). However, Chladek (Happ et sa., (1987) j. Org. 

52:5387) has recently reported aminoacylation of 5'- 
25 CpCpA in 26% overall yield via an alternate strategy. 

Chemical acylation reactions (80^ total volume) 
contained the following: 600mM pCpA-Phe (40Mg) , 10 uM 
tENA (20Mg, which had been desalted and lyophilized 

following gel purification), 55 mM HEPES (pH 7.5), 250/xM ATP, 
15mM MgCl^, 20mg/ml. BSA, DMSO (to 10% v/v) and 200 units T4 
BHA ligase. The reaction mixture was incubated at 37 'C for 
12 minutes, quenched by addition of 2.5 K NaOAc (pH4.5) to 
10% v/v and treated as described above, but with only one 
round of extraction/precipitation. The di-NPS protected 
35 aminoacyl pCpA was also a substrate for T4 RNA ligase, but 
better yields of the aminoacyl tRNA were obtained by 
deprotection followed by ligation rather than ligation and 
siibsequent deprotection. 



30 
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Fully deprotected pCpA-Phe was ligated directly to 
tRNA^^^U®^(-CA) using T4 RNA ligase (note that the truncated 
suppressor tRNA is generated directly by the anticodon loop 
replacement method) . The yield of Phe-tRNA^^^^®^ is 35% 
5 based on analysis of 3H-Phe incorporation into the purified 
suppressor (gel electrophoresis indicates 80-90% of the 
tRNA^^*^^j®^ (-CA) is converted to material with the same 
mobility of tBNA^j,^^®^) . Using this procedure tRNA^^^^^^j^ (- 
CA) was also aminoacylated with D-phenylalanine (D-Phe) , (S)- 
10 p-nitrophenylalanine (p-N02'-Phe) , (S) -homophenylalanine (2- 

amino-4-phenylbutanoic acid, HPhe) , (S) -p-f luorophenylalanine 
(p-FPhe) , (S) -3-amino-2-ben2ylpropionic acid (ABPA) and (S)- 
2-hydroxy-3-phenylpropionic acid (PLA) (in this case no a- 
hydroxyl protection was used). These euninoacyl tRNA's were 
15 used in in vitro protein synthesis to synthesize mutant 
lactamases (vide infra) . Current efforts to optimize 
aminoacylation include the use of acid labile protecting 
groups and protecting groups that: can be removed by 
hydrogenation, as well as an investigation of the use of non- 
20 selective lipases for the aminoacylation of unprotected RNA. 
Protecting groups which can be removed by hydrogenolysis or 
acid treatment will also simplify protection of unnatural 
amino acid side chains. 



25 In vitro Suppression 

In vitro reactions primed with pF66am and 
supplemented with suppressor that had been enzymatically 
acylated with [^H] -phenylalanine (30% acylated) , to a final 
concentration of 167 fig/vkL, yielded 5.5-7.5 /ig/mL of active 

30 )9-lactamase, which represents 15-20% suppression efficiency 

(Fig. 8) . Suppressor that had been chemically acylated using 
pCpA-Phe (35% acylated) resulted in a yield of 2.8-7.5 /xg/mL. 
This is a sufficient yield for purification of the enzyme to 
near homogeneity from a 1 mL reaction in 7-15% overall yield 

35 (Fig. 8) using ammonitm sulfate precipitation followed by 
chromatof ocusing and anion exchange chromatography. 
Importantly, the k^^^ and for the purified /3-lactamase 
were identical to those of the wild-type enzyme produced jji 
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YitES (Figure 10) . Site-specific insertion of ^H- 
phenylalanine by [^Hj-Phe-tRNA^c^^^ /3-lactamase was 

by peptide mapping experiments, oaiere are twenty- 
seven putative trypsin cleavage sites, and five phenylalanine 
5 residues in TREM /3-lactamase (Sutcliffe, (1978) Proc. Natl. 
^cad. Scj USA 75:3737; Ambler and Scott (1978) Proc. Natl. 
Aca<^, Scj,. USA 75:3732; Pollitt and Zalkin, (1983) J^. 
■ BaPt^lgX' 153:27). These five phenylalanines are 
distributed in four of the twenty-eight tryptic fragments, 
10 with one eight-residue peptide containing both Phe66 and 

Phe72. ^-lactamase was synthesized ia vitro from pS67 in the 
presence of added [-^Hl-ph^ylalanine. The purified 
radiolabelled enzyme was digested with trypsin and the 
fragments were separated by reversed-phase FPLC (Fig. 9). 
15 Four discrete radioactive peaks were observed, in agreement 
with the locations of [%]-Phe in RTEM ^-lactamase 
(Sutcliffe, (1978) Proc. Na tl. Acad. Sci ttsa 75:3737; Ambler 
and Scott (1978) Proc. Nat l. Acad. Sci. psa 75:3732; Pollitt 
and Zalkin, (1983) J. BaefceT-ioi . 153:27; Fisher et al. , (198) 
20 BiPcngmtstn>- 19:2895 and Knowles (1985) Ace, chem. Po« , 

18:97>. The peak that elutes in fractions 44 and 45 contains 
twice as many counts as the other three peaks and is assigned 
as the peptide that contains F66 and P72. A similar analysis 
of tryptic peptides derived from /9-lactamase synthesized jji 
25 vitro from pF66affi in the presence of added [^H]-Phe- 
C u\ shows one radioactive peak. The fact that 
radioactivity elutes in fraction 44 in both the wild-type 
(pSG7) and suppressed (pSF66am) experiments taken together 
with the observation that in the wild-type experiments this 
30 peak contains twice as mucdi radioactivity as the others, 

strongly suggests that in the suppressed e3q>eriment [^H]-Phe 
Is inserted uniquely at the target site (P66) . 

The phenylalanine analogues D-Phe, HPhe, ABPA and 
PIA were each loaded onto suppressor tRNA as described above. 
35 In vjtro protein synthesis reactions carried out in the 

presence of [^^S] -methionine resulted in similar levels of 
radioactivity incorporated into trichloracetic acid (TCA)- 
precipitable material for the p-FPhe, p-N02Phe and HPhe 
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reactions. Kinetic analyses of the /S^lactamases synthesized 
in these reactions demonstrate similar K^*s but different 
^cat'® (Figure 10). Direct quantitation of the purified 
N02Phe and HPhe mutants was impossible, as both mutants lost 
5 activity during purification attempts. Consequently, ["^^S]- 
methionine incorporation and TCA precipitation were used to 
quantitate all the mutants for the purpose of direct 
comparison. Experiments with D-Phe, FLA and ABPA resulted in 
no detecteJsle ^-lactamase activity or protein synthesis. 
10 These results demonstrate that in the case of 

phenylalanine, analogues that differ in both steric and 
electronic properties can be substituted into proteins. 
Replacement of Phe66 by Tyr, which differs from Phe both 
sterically (4-OH group) and electronically (-NO2 is a good tt- 

15 electron donor) , leads to an approximate twofold decrease in 
k^^l- with little effect on K^. Similarly, replacement of 
Phe66 by p-nitrophenylalanine, which again differs both 
sterically (4-NO2 group) and electronically (-OH is a good tt- 
electron acceptor) , leads to an approximate twofold decrease 

20 in K^g^^ with little effect on K^. On the other hand, 

replacement of Phe66 by p-f luorophenylalanine, which is both 
sterically and electronically similar to Phe, leads to a 
slight increase in l^^g^^ ^^"'^^ effect on i^. Replacement of 
Phe66 with homophenylalanine, which is electronically 

25 identical to Phe but substantially different sterically, 
leads to an approximate sixfold decrease in and an 

increase in K^. Both the HPhe and p-N02Phe mutants, which 
correspond to the greatest steric perturbation, are unstable 
and presumably unfold and proteolyze druring purification 

3 0 attempts . 

Attempts to alter the protein backbone by replacing 
the sonide linkage with an ester linkage (PIA) , adding an 
additional methylene group into the backbone (ABPA) , or 
changing the sterochemistry of the a-carbon (D-Phe) lead to 
35 no detectable synthesized protein or enzymatic activity. It 
is not clear at present whether these results stem from 
impaired protein folding or stability (Phe66 is adjacent to a 
proline residue) or from insdsility of these amino acids to 
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function as ribosomal A site acceptors, it has been reported 
that PIA is incorporated into the aiaino terminal position of 
polyphenylalanine (Herve and Chapevill, (1965) J, Mol, Biol, 
13:757) and that N-acetyl-D-phenylalanine functions poorly as 
5 a P-site donor in response to a poly-(U) message (Rosesser et 
al. (1986) Biochemistiirv 25:6361 and Heckler et al. , (1983) J\. 
. Biol^ cn^m. 258:4492). Yamane et al. report that K^^ss 
the D-Tyr-tRNA - EF-Tu ternary complex is 25-fold higher than 
for the L-Tyr-tRNA - EF-Tu ternary complex (Yamane et al. , 
10 (1981) Biochemistirv 20:7059). The same level of 

stereochemical selectivity for ternary complex formation 
between EF-Tu. and D-Phe-tRNA^^,^^®^ would result in 
substantial hydrolysis of the aminoacyl ester during in vitro 
reactions as well as poor EF-Tu mediated binding of the 
15 aminoacyl tRNA to the ribosome. 

Sufficient protein can be purified to characterize 
the catalytic constants and specificity of the mutants, to 
carry out limited mechanistic and mapping studies and to 
probe protein structure with techniques such as ESR and 
20 fluorescence spectroscopy. Improvements in in vitro protein 
synthesis, methods for tRNA generation, and tRNA 
aminoacylation chemistry will permit production of milligram 
quantities of mutant proteins via this strategy. 

From the foregoing, it will be appreciated that the 
25 present invention provides improved means for producing 
modified proteins. The methods are rapid, simple and 
. universal in utility. 

All publications and patent applications mentioned 
in this specification are indicative of the level of skill of 
30 those skilled in the art to which this invention pertains. 
All publications and patent applications are herein 
incorporated by reference to the same extent as if each 
individual publication or patent application was specifically 
and individually indicated to be incorporated by reference. 
35 The invention now being fully described, it will be apparent 
to one of ordinary skill in the art that many changes and 
modifications can be made thereto without departing from the 
spirit or scope of the claims. 
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WE CLAIM; 

1. A method for sl'te specifically incozpora1:ing 
5 an unna1:ural amino acid analogue int:o a pro1:ein, said metJiod 
comprising: 

(a) in'kroducing a preselected codon into at least 
one site in a mSNA sequence encoding the protein; and 

(b) translating the mRNA sequence in a protein 
10 synthesizing system comprising an aminoacyl tRNA analogue 

capable of polymerizing the unnatural amino acid analogue 
into a nascent polypeptide chain at the direction of the 
preselected codon. 

15 2. A method of Claim 1, wherein the protein 

synthesizing system is an jji vitro protein synthesizing 
system. 

3. A method of Claim 2, wherein the in vitro 
20 protein synthesizing system is prepared from: 



codon is a translation termination codon. 

5. A method of Claim 4, wherein the preselected 
30 codon is a UAG (eunber) codon. 



a) 
b) 
c) 



coli; 
S . cerevislae ; 
wheat germ; or 
rabbit reticulocyte. 



25 



4. 



A method of Claim 1, wherein the preselected 



6. A method of Claim A, wherein the preselected 
codon is inserted at predetermined sites. 
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7. A method of Claim 1^ wherein tJie unnatural 
amino acid analogue is one selected from the group of; 

i) modified natural amino acids; 

ii) modified uncharged amino acids; 
5 - iii) modified acidic amino acids; 

iv) modified basic amino acids; 
V) non-alpha amino acids; 

vi) amino acids with altered ^, 0 angles; and 

vii) amino acids containing functional groups 
10 selected from the group of nitro, amidine/ hydroxylamine, 

quinoiie, aliphatic, cyclic emd unsaturated chemical groups. 

8. A method of Claim 1, wherein the aminoacyl 
tPNA analogue is the only aminoacyl tRNA molecule in the 

15 protein synthesizing system capsible of recognizing the 
preselected codon. 

9-. A method of Claim 1, wherein the preselected 
CQdon is introduced into one site of the mRNA sequence 
20 encoding the protein. 

10. A method of Claim 1, wherein the protein has a 
molecular weight greater than about ten thousand daltons. 

2^ 11. A method of Claim 1, wherein the unnatural 

amino acid analogue is situated within eUsout 100 angstroms 
of: 

a) a substrate binding site; 

h) an enzymatic active site; 

30 c) a protein-protein interface; 

d) a cof actor binding site; or 

e) a ligand (agonist or antagonist) binding site. 



12. A protein synthesized by the method of Claim 
35 1, 8 or 10. 
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13 - A protein according to Claim 12 , wherein the 
protein is substantially stoichiometrically substituted at 
one or more predetermined sites. 

5 14. A protein according to Claim 12, wherein the 

protein is substantially homogeneous in its substitutions at 
one or more predetermined sites. 

15. A protein according to Claim 13, wherein the 
10 protein is svibstantially homogeneous in its substitutions. 

16. A method for determination of physical, 
chemical or biochemical properties of a protein, such method 
comprising the steps of: 

15 a) synthesizing a substantially pure protein 

stoichiometrically substituted at specific sites by the 
method of Claim 1, 8 or 10; and 

b) analyzing the physical or biochemical 
properties of the protein. 

20 

17. A method of Claim 16, wherein the protein is 
analyzed by X-ray crystallography or NUR. 

18. A method of Claim 16, wherein analyzing the 
25 physical or biochemical properties of the protein determines: 

a) static physical properties of the polypeptide 

chain; 

b) mechanism of action of an enzymatic reaction; 

c) specificity of protein binding to ligand; 

30 d) dynamic interaction of amino acid residues of 

a subject protein with a substrate; 

e) folding of the protein; or 

f ) interaction of the protein with other 
proteins, with nucleic acids or with sugars. 



35 



19. A method of Claim 16, wherein the protein is 
analyzed within about 100 angstroms of the unnatural amino 
acid analogue. 
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20. A method for making multiple alternative 
stibstitutions at preselected amino acid positions of a 
protein comprising; 

5 a) producing one mRNA with mistranslation codons 

at sites in the mRNA corresponding to the preselected amino 
acid pes it ions ; and 

b) translating the mRNA in a series of two or 
more translation systems each comprising an aminoacyl tRNA 
10 analogue r whereby the protein produced by one translation 

system differs from the protein produced by another system at 
the preselected amino acid position. 

21. A method of Claim 20, wherein the difference 
15 between proteins produced by the different translation 

systems is predetermined by the preselection of lonnatural 
amino acid analogues attached to the aminoacyl tRNAs. 

22. A method of Claim 20, wherein the protein has 
20 a molecular weight greater than about ten thousand daltons. 

23. A method of Claim 20, wherein one amino acid 
position is substituted. 

24. A method of Claim 20, wherein the unnatural 
amino acid substitution is selected from the group of r 

i) D-phenylalanine; 

i i ) ( S ) -p-^nitropheny lalanine ; 

iii) (S)-homophenylalanine; 

2^ . iv) CS)-p-fiuor6phenylalanine; 

V) (S)-3-amino-2-ben2ylpropionic acid; and 
vi) (S) -2-hydroxy-3-phenylpropionic acid. 

25. A method of Claim 20, wherein the 

35 mistranslation codoh is a translation termination codon. 



26. A method of Claim 25, wherein the translation 
termination codon is a UAG (amber) termination codon. 



wo 90/05785 



51 



PCr/US89/0S256 



27. A method for producing an aminoacyl tRNA 
analogue molecule, such method comprising the steps of: 

a) attaching a predetermined unnatural amino acid 
5 analogue by an aminoacyl linkage at 2 ' or 3 * ribosyl hydroxyl 

positions on the 3* terminal nucleotide of a multi-nucleotide 
molecule (MNH) ; and 

b) ligating the aminoacyl -multi-nucleotide 
molecule ( aminoacyl -MNM) to a truncated tRNA molecule 

10 (tPNA(-Z)), wherein a functional aminoacyl tPNA analogue 
molecule is formed. 

28. A method of Claim 27, wherein the multi- 
nucleotide molecule (MNM) corresponds to a tBNA 3* terminus. 

15 

29. A method of Claim 27 or 28, wherein the multi- 
nucleotide molecule (MNM) is a dinucleotide. 

30. A method of Claim 29, wherein the dinucleotide 
20 is 5»-pCpA-3'. 

31. A method of Claim 27, wherein ligation of the 
multi-nucleotide molecule (MNM) to the tRNA(-Z) molecule 
generates a complete tRNA molecule. 



25 



32. A method of Claim 27, wherein the tRNA(-Z} is 
derived from a run-off transcript. 



33. A method of Claim 27 or 30, wherein the 
30 attaching of the predetermined unnatural amino acid analogue 
by an aminoacyl linkage at 2 or 3 * ribosyl hydroxyl positions 
on the 3 * terminal nucleotide of a multi-nucleotide molecule 
(MNM) is accomplished by the steps of: 

a) protecting reactive chemical groups of the MNM 
35 with protective agents; 

b) protecting reactive non-aminoacyl reactive 
groups of the amino acid analogue with a blocking agent; 
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c) acylating the MNH with a blocking agent- 
protected amino acid analogue; and 

d) removing the protective agents and blocking 
agents from the protected reactive sites. 

5 . 

34. A method of Claim 33, wherein some or all of 
the reactive group protecting steps are substituted with 
steps using blocking or protective agents selected from the 
group consisting of: 

a) o-nitrophenylsulfenyl (NSP) ; 

b) ^-cyanoethyl (EtCNO) ; 

c) benzylo3^carbonyl (CBZ) s 

d) 9-fluorenylamethyloxycarbonyl (FMOC) ; 

e) 2-(4-biphenyl) isopropyloxycarbonyl (BPOC) ; 

f ) vinyl oxycarbonyl CVOC) ; 
.9:) tetrahydropyranyl (THPJ ; 

h) methoxytetrahydropyranyl f emd 

i] photolabile groups. 

2° 35. A method of Claim 33, wherein the protecting 

steps are performed using o-nitrophenylsulfenyl (NPS) for 
both the blocking agents and protective agents. 



25 



30 



35 



36. A method of Claim 27, wherein the ligating of 
the aminoacyl-MNM to the tRNA(-Z) is performed by the enzyme 
T4 RNA ligase. 

37. An aminoacyl tRNA analogue molecule having; 
a) the formula X - A - y - m, wherein: 

X = 5' nucleotide sequence of a tRNA molecule; 
A =f anticodon nucleotides; 

Y = 3« nucleotide sequence of a tRNA molecule; 
M «= amino acid analogue selected from the 
group consisting of: 

i) modified unchaxxfed natural 
amino acids; 
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11} modified acidic natural amino 

acids ; and 
111) non-alpha amino acids; and 
b) activity to direct the polymerization of the M 
5 component Into a nascent polypeptide chain and which can 
serve as an acceptor for further peptide polymerization • 

38. A molecule of Claim 37, wherein the Y 
component has a 3 • terminus of 5 • -pCpCpA-3 • . 

10 

39. A molecule of Claim 38, corresponding to 
tENA^^jj^j^ amlnoacylated with (S) -p-nltrophenylalanlne, 
wherein: 

a) X comprises the 5 • segment of tRNA^^*^^®^ 
15 containing a "D loop" and part of an "antlcodon loop"; 

b) A (antlcodon) comprises the 
trinucleotide 5 • -pCpUpA-3 • ; 

c) Y comprises the 3« segment of tRNA^^*^^®^ 
containing part of an "antlcodon loop", a "variable loop", a 

20 "TOc loop", and an "acceptor stem"; and 

d) M is (S)-p-nitrophenylalanine, 

40. A translation system comprising an euninoacyl 
tRNA analogue molecule of Claim 37. 



25 



30 



41. A coupled transcription and translation system 
wherein products of the transcription system are translated 
by a translation system comprising an aminoacyl tRNA analogue 
of Claim 37. 

42. A substantially homogeneous protein of greater 
than about 10,000 daltons, wherein an unnatural amino acid 
has been stoichlometrically substituted at specific sites. 



35 
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